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TECHNICAL FIELD 

This invention relates to compositions and methods and products of 
Pharmaceutical Proteins, Human Therapeutics, Human Serum Albumin, Insulin, Native 
Cholera Toxic B Submitted On Transgenics Plastids, containing transformed plastids. 



This invention relates to several embodiments which are disclosed herein in 
several specifications and corresponding figures titled as Pharmaceutical Proteins, 
Human Therapeutics, Human Serum Albumin, Insulin, Native Cholera Toxic B 
Submitted On Transgenics Plastids presented as one patent application and a set of claims 
25 thereof. 



NON-OBVIOUS NATURE OF INVENTION 



Despite the potential advantages of chloroplasts for biopharmaceutical production, it was 
30 not obvious that expressed' pharmaceutical proteins in plastids would assemble in this organelle. 
Prior to this patent application there were no published reports of biopharmaceutical proteins 
expression in chloroplasts. Indeed, there were valid reasons to suggest that such expression 
would be problematic Proinsulin contains both a and P chains and the C-peptide that connects 
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them. It is synthesized as a pre-proinsulin by the pancreas. After proper folding, disulfide 
bridges are established between a and p chains. NtH reviewers noted that chloroplasts 
expression of high levels of properly assembled proinsulin was unanticipated. Nor was it 
obvious, as pointed out by MH reviewers, that proinsulin expressed within plastids would be 
5 fully functional Prior to this invention, there was no report of expression of proinsulin within 
plastids. 

Similarly, prior to this invention, there was no report of expression of Human Serum 
Albumin within plastids. Human serum albumin is a globular protein of 66.5 kDa in size that 
should be properly folded and stabilized with seventeen disulfide bridges. Each human serum 

0 albumin consists of three structurally similar globular domains and the disulfides are positioned 
in repeated series of nine loop-link-loop structures centered around eight sequential Cys-Cys 
pairs. HSA is initially synthesized as pre-pix>-albumin by the liver and released from the 
endopiasmatic reticulum after removal of the armnoterminal prepeptide of 18 amino acids. The 
pro-albumin is further processed in the Golgi complex where the other 6 anunoterminal residues 

5 of the propeptide are cleaved by a serine proteinase. This results in the secretion of the mature 
polypeptide of 585 amino acids. It was likewise unanticipated that fully assembled HSA would 
be synthesized in large amount within plastids. 

Vibrio cholerae causes diarrhea by colonizing the small intestine and producing 
0 enterotoxins, of which the cholera toxin (CT) is considered the main cause of toxicity. CT is a 
hexameric AB5 protein having one 27KDa A subunit which has toxic ADP-ribosyl transferase 
activity and a non-toxic pentamer of 1 1.6 kDa B subunits that are non-covalently linked into a 
very stable doughnut like structure into which the toxic active (A) subunit is inserted. The A 
subunit of CT consists of two fragments — Al and A2 which are linked by a disulfide bond The 
5 enzymatic activity of CT is located solely on the Al fragment. The A2 fragment of the A subunit 
links the Al fragment and the B pentamer. CT binds via specific interactions of the B-subunit 
pentamer with GM1 ganglioside, the membrane receptor, present on the intestinal epithelial cell 
surface of the host. The A subunit is then translocated into the cell where it ADP-ribosylates the 
Qs subunit of adenylate cyclase bringing about the increased levels of cyclic AMP in affected 
0 cells that is associated with the electrolyte and fluid loss of clinical cholera. However, the B 
subunit, when administered orally, is a potent mucosal immunogen which can neutralize the 
toxicity of CT holotoxin by preventing its binding to intestinal cells. To achieve this effect, the 
B-subunit must be assembled in a pentameric form and disulfide bridges should be established 
among the subunit structures. It was not obvious that plastids could express and assemble 
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pentameric CTB. There has been no prior report of expression of CTB or its assembly within 
plastids. 

There was no certainty that biopharmaceuticals would assemble normally in chloroplasts 
5 or that they would retain their functionality in the prior art. Indeed, there might have been 

unforeseen deleterious effects of high-level expression of biopharmaceuticals in chloroplasts on 
plant growth or development that were not apparent from the experiences with other transgenes. 
The pH and oxidation state of the chloroplast differs from that of the human blood or bacterial 
cell in ways that might inhibit or prevent proper folding and assembly. 

10 There are examples of protein complexes in the chloroplast in which all the subunits are 

native to the plant, the ribosome being an example. However, the expression and assembly in 
transformed chloroplasts of heterologous proteins into multi-protein complexes has not been 
reported until the present invention. There is a single example in the literature of an inter-chain 
disulfide bond in plant chloroplasts, and that is between neighboring large subunits of the 

15 enzyme ribulose-1, 5-biphosphase carboxylase/oxygenase. The expression and assembly in 

transformed chloroplasts of functional proteins consisting of different protein chains, including 
disulfide bonds between different subunits, as represented by expression and assembly of a 
mammalian protein has never been demonstrated until the present invention. 
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PRODUCTION OF PHARMACEUT IC AT, PROTEINS 
IN TRANSGEN IC CHT O ROPL ASTS 

RELATED APPLICATION 

This patent application claims the benefit of U.S. Provisional Application No. 
60/185,987, filed March 1,2000. This earlier provisional application is hereby incorporated 
by reference. 

FIELD OF THE INVENTION 

This invention relates to the production of pharmaceutical proteins in transgenic 
chloroplasts. More particularly, this invention relates to the production of human insulin in 
tobacco plants. 

BACKGROUND 

Research efforts have been made to synthesize high value pharmacologically active 
recombinant proteins in plants. Recombinant proteins such as vaccines, monoclonal 
antibodies, hormones, growth factors, neuropeptides, cytotoxins, serum proteins and 
enzymes have been expressed in nuclear transgenic plants (May et al., 1996). It has been 
estimated that one tobacco plant should be able to produce more recombinant protein than 
a 300-liter fermenter of E. colL In addition, a tobacco plant produces a million seeds, 
thereby facilitating large-scale production. Tobacco is also an ideal choice because of its 
relative ease of genetic manipulation and an impending need to explore alternate uses for 
this hazardous crop. 

A primary reason for the high cost of production via fermentation is the cost of carbon 
source co-substances as well as maintenance of a large fermentation facility. In contrast, 
most estimates of plant production are a thousand-fold less expensive than fermentation. 
Tissue specific expression of high value proteins in leaves can enable the use of crop plants 
as renewable resources. Harvesting the cobs, tubers, seeds or fruits for food and feed and 
leaves for value added products should result in further economy with no additional 
investment 
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However, one of the major limitations in producing pharmaceutical proteins in plants 
is their low level of foreign protein expression, despite reports of higher level expression of 
enzymes and certain proteins. May et ai. (1998) discuss this problem using the following 
examples. Although plant derived recombinant hepatitis B surface antigen was as effective 
5 as a commercial recombinant vaccine, the levels of expression in transgenic tobacco were 
low (0.01% of total soluble protein). Even though Norwalk virus capsid protein expressed 
in potatoes caused oral immunization when consumed as food (edible vaccine), expression 
levels were low (0.3% of total soluble protein). A synthetic gene coding for the human 
epidermal growth factor was expressed only up to 0.001% of total soluble protein in 

10 transgenic tobacco. Human serum albumin has been expressed only up to 0.02% of the total 
soluble protein in transgenic plants. 

Therefore, it is important to increase levels of expression of recombinant proteins in 
plants to exploit plant production of pharmacologically important proteins. An alternate 
approach is to express foreign proteins in chloroplasts of higher plant. Foreign genes (up to 

15 1 0,000 copies per cell) have been incorporated into the tobacco chloroplast genome resulting 
in accumulation of recombinant proteins up to 30% of the total cellular protein (McBride et 
aL, 1994). 

The aforementioned approaches (except chloroplast transformation) are limited to 
eukaryotic gene expression because prokaryotic genes are expressed poorly in the nuclear 

20 compartment. However, several pharmacologically important proteins (such as insulin, 
human serum albumin, antibodies, enzymes etc.) are produced currently in £. coli. Also, 
several bacterial proteins (such as cholera toxin B subunit) are used as oral vaccines against 
diarrheal diseases. Therefore, it is important to develop a plant production system for 
expression of pharmacologically important proteins that are currently produced in 

25 prokaryotic systems (such as E, coli) via fermentation. 

Chloroplasts are prokaryotic compartments inside eukaryotic cells. Since the 
transcriptional and translational machinery of the chloroplast is similar to E. coli (Brixey et 
al., 1 997) 7 it is possible to express prokaryotic genes at very high, levels in plant chloroplasts 
than in the nucleus. In addition, plant cells contain up to 50,000 copies of the circular plastid 

30 genome (Bendich 1987) which may amplify the foreign gene like a "plasmid in the plant 
cell," thereby enabling higher levels of expression . Therefore, chloroplasts are an ideal 
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choice for expression of recombinant proteins that are currently expressed in E. coli (such 
as insulin, human serum albumin, vaccines, antibodies, etc.). We exploited the chloroplast 
transformation approach to express a pharmacological protein that is of no value to the plant 
to demonstrate this concept, GVGVP gene has been synthesized with a codon preferred for 
5 prokaiyotic (EG121) or eukaryotic (TG131) expression. Based on transcript levels, 
chloroplast expression of this polymer was a hundred-fold higher than nuclear expression 
in transgenic plants (Guda et al., 1999). Recently, we observed 16.966-fold more tps 1 
transcripts in chloroplast transf ormants than the highly expressing nuclear transgenic plants 
(Lee et al. 2000, in review). 

10 

DETAILED DESCRIPTION 

In our research, we use insulin as a model protein to demonstrate its production as a 
value added trait in transgenic tobacco. Most importantly, a significant advantage in the 
production of pharmaceutical proteins in chloroplasts is their ability to process eukaryotic 

15 protein, including folding and formation of disulfide bridges (Dreshcher et al., 1998). 
Chaperonin proteins are present in chloroplasts (Verling 1991 ; Roy 1989) that function in 
folding and assembly of prokaryotic/eukaryotic proteins. Also, proteins are activated by 
disulfide bond oxido/reduction cycles using the chloroplast inicredoxin system (Reulland and 
Miginiac-Maslow, 1999) or chloroplast protein disulfide isomerase (Kim and Mayfield, 

20 1997). Accumulation of fully assembled, disulfide bonded form of antibody inside 
chloroplasts, even though plastics were not transformed (During et al. 1 990), provides strong 
evidence for successful assembly of proinsulin inside chloroplasts. Indeed, we observed 
fully assembled heavy and light chains of humanized Guy's 13 antibody in transgenic 
tobacco chloroplasts (Panchal et al. 2000, in review). Such folding and assembly eliminates 

25 the need for post-purification processing of pharmaceutical proteins . Chloroplasts may also 
be isolated from crude homogenates by centrifugation (1500 X g). This fraction is free of 
other cellular proteins. Isolated chloroplasts are burst open by osmotic shock to release 
foreign proteins that are compartmentalized in this organelle along with few other native 
soluble proteins (Daniel and McFadden, 1987). 

30 GVGVP is a PBP made from synthetic genes. At lower temperatures the polymers 

exist as more extended molecules which, on raising the temperature above the transition 
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range, hydrophobically fold into dynamic structures called B-spirals that further aggregate 
by hydrophobic association to form twisted filaments (Urry, 1991; Urry, et al., 1994). 
Inverse temperature transition offers several advantages. Expense associated with 
chromatographic resins and equipment are eliminated. It also facilitates scale up of 

5 purification from grams to kilograms. Milder purification conditions use only a modest 
change in temperature and ionic strength. This also facilitates higher recovery, faster 
purification and high volume processing. Protein purification is generally the slow step 
(bottleneck) in pharmaceutical product development. Through exploitation of this reversible 
inverse temperature transition property, simple and inexpensive extraction and purification 

10 is performed. The temperature at which the aggregation takes place can be manipulated by 
engineering biopolymers containing varying numbers of repeats and changing salt 
concentration in solution (McPherson et al., 1996). Chloroplast mediated expression of 
insulin-polymer fusion protein eliminates the need for the expensive fermentation process 
as well as reagents needed for recombinant protein purification and downstream processing. 

15 

Large-scale production of insulin in plants in conjunction with an oral delivery system 
is a powerful approach to provide insulin to diabetes patients at an affordable cost and 
provide tobacco farmers alternate uses for this hazardous crop. For example, Sun et al. 
(1994) showed that feeding a small dose of antigens conjugated to the receptor binding non- 
20 toxic B subunit moiety of the cholera toxin (CTB) suppressed systemic T cell-mediated 
inflammatory reactions in animals. Oral administration of a myelin antigen conjugated to 
CTB has been shown to protect animals against encephalomyelitis, even when given after 
disease induction (Sun et al. 1996). Bergerot et al. (1997) reported that feeding small 
amounts of human insulin conjugated to CTB suppressed beta cell destruction and clinical 
25 diabetes in adult non-obese diabetic (NOD) mice. The protective effect could be transferred 
by T cells from CTB-insulin treated animals and was associated with reduced insulitis. 
These results demonstrate that protection against autoimmune diabetes can indeed be 
achieved by feeding small amounts of pancreas islet cell auto antigen linked to CTB 
(Bergerot, et al. 1997). Conjugation with CTB facilitates antigen delivery and presentation 
30 to the Gut Associated Lymphoid Tissues (GALT) due to its affinity for the cell surface 
receptor GM-ganglioside located on GALT cells, for increased uptake and immunologic 
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recognition (Arakawa et al. 1998). Transgenic potato tubers expressed up to 0.1% CTB- 
insulin fusion protein of total soluble protein, which retained GM-ganglioside binding 
affinity and native autogenicty for both CTB and insulin. NOD mice fed with transgenic 
potato tubers containing microgram quantities of CTB-insulin fusion protein showed a 
substantial reduction in insulitis and a delay in the progression of diabetes (Arkawa et al., . 
1998). However, for commercial exploitation, the levels of expression need to be increased 
in transgenic plants. Therefore, we undertook the expression of CTB-insulin fiision in 
transgenic chloroplasts of nicotine free edible tobacco to increase levels of expression 
adequate for animal testing. 

In accordance with one advantageous feature of this invention, we use poly(GVGVP) 
as a fusion protein to enable hyper-expression of insulin and accomplish rapid one step 
purification of fusion peptides utilizing the inverse temperature transition properties of this 
polymer. In another advantageous feature of this invention, we develop insulin-CTB fusion 
protein for oral delivery in nicotine free edible tobacco (LAMD 605), Both features are 
accomplished as follows: 

a) Develop recombinant DNA vectors for enhanced expression of Proinsulin as fusion 
proteins with GVGVP or CTB via chloroplast genomes of tobacco, 

b) Obtain transgenic tobacco (Petit Havana & LAMD 605) plants, 

c) Characterize transgenic expression of proinsulin polymer or CTB fusion proteins 
using molecular and biochemical methods in chloroplasts, 

d) Employ existing or modified methods of polymer purification from transgenic leaves^ 

e) Analyze Mendelian or maternal inheritance of transgenic plants, 

f) Large scale purification of insulin and comparison of current insulin purification 
methods with polymer-based purification method in E. coli and tobacco, 

g) Compare natural refolding chloroplasts with in vitro processing, 

h) Characterization (yield and purity) of proinsulin produced in E. coli and transgenic 
tobacco, and 

i) Assessment of diabetic symptoms in mice fed with edible tobacco expressing CTB- 
insulin fusion protein. 

Diabetes and Insulin: Insulin lowers blood glucose (OaHy et al. 1973). This is a result of 
its immediate effect in increasing glucose uptake in tissues. In muscle, under die action of 
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insulin, glucose is more readily taken up and either converted to glycogen and lactic acid or 
oxidized to carbon dioxide. Insulin also affects a number of important enzymes concerned 
with cellular metabolism. It increases the activity of glucokinase, which phosphoryiates 
glucose, thereby increasing the rate of glucose metabolism in the liver. Insulin also 
suppresses gluconeogenesis by depressing the function of liver enzymes, which operate the 
reverse pathway from proteins to glucose. Lack of insulin can restrict the transport of 
glucose into muscle and adipose tissue. This results in increases in blood glucose levels 
(hyperglycemia). In addition, the breakdown of natural fat to free fatty acids and glycerol 
is increased and there is a rise in the fatty acid content in the blood. Increased catabolism of 
fatty acids by the liver results in greater production of ketone bodies. They diffuse from the 
liver and pass to the muscles for further oxidation. Soon, ketone body production rate 
exceeds oxidation rate and ketosis results. Fewer amino acids are taken up by the tissues and 
protein degradation results. At the same time, gluconeogenesis is stimulated and protein is 
used to produce glucose. Obviously, lack of insulin has serious consequences. 

Diabetes is classified into types I and II. Type I is also known as insulin dependent 
diabetes mellitus (TDDM). Usually this is causedby a cell-mediated autoimmune destruction 
of the pancreatic 0-cells (Davidson, 1998). Those suffering from this type are dependent on 
external sources of insulin. Type II is known as noninsulin-dependent diabetes mellitus 
(NTDDM). This usually involved resistance to insulin in combination with its 
underproduction. These prominent diseases have led to extensive research into microbial 
production of recombinant human insulin (rHT). 

Expression of Recombinant Human Insulin in E. coli: In 1978, two thousand kilograms 
of insulin were used in the world each year; half of this was used in the United States 
(Steiner et al., 1978). At that time, the number of diabetics in the US were increasing 6% 
every year (Gunby, 1978). In 1997 - 98, 10% increase in sales of diabetes care products and 
19% increase in insulin products have been reported by Novo Nordisk (world's leading 
supplier of insulin), making it a 7.8 billion dollar industry. Annually, 160,000 Americans 
are killed by diabetes, making it the fourth leading cause of death. Many methods of 
production of rffl have been developed. Insulin genes were first chemically synthesized for 
expression in Esherichia coli (Crea et al., 1978). These genes encoded separate insulin A 
and B chains. The genes were each expressed in E. coli as fusion proteins with the 0- 



WO 01/72959 ~ ~~ "PCT/US01/06288 

10 



galactosidase (Goeddel et aL, 1979). The first documented production of rHI using this 
system was reported by David Goeddel from Genentech (Hall, 1988). Forreasons explained 
later, the genes were fused to the Tip synthase gene. This fusion protein was approved for 
commercial production by Eli Lilly in 1982 (Chance and Frank, 1993) with a product name 
5 of Humulin. As of 1986, Humulin was produced from proinsulin genes, Proinsulin contains 
both insulin chains and the C-peptide that connects them. Data concerning commercial 
production of Humulin and other insulin products is now considered proprietary information 
and is not available to the public. 

Delivery of Human Insulin: Insulin has been delivered intravenously in the past several 

10 years. However, more recently, alternate methods such as nasal spray are also available. 
Oral delivery of insulin is yet another new approach (Mathiowitz et aL, 1997). Engineered 
polymer microspheres made of biologically erodable polymers, which display strong 
interactions with gastrointestinal mucus and cellular linings, can traverse both mucosal 
absorptive epithelium and the follicle-associated epithelium, covering the lymphoid tissue 

15 of Peyer's patches. Polymers maintain contact with intestinal epithelium for extended 
periods of time and actually penetrate through and between cells. Animals fed with the 
poly(FA: PLGA)-encapsulated insulin preparation were able to regulate the glucose load 
better than controls, confirming that insulin crossed the intestinal barrier and was released 
from the microspheres in a biologically active form (Mathiowitz et aL, 1997). 

20 Protein Based Polymers (PBP): The synthetic gene that codes for a bioelastic PBP was 
designed after repeated amino acid sequences GVGVP, observed in all sequenced 
mammalian elastin proteins (Yeh et al. 1987), Elastin is one of the strongest known natural 
fibers and is present in skin, ligaments, and arterial walls. Bioelastic PBPs containing 
multiple repeats of this pentamer have remarkable elastic properties, enabling several 

25 medical and non-medical applications (Uny et al. 1993, Urry 1995, Daniell 1995). GVGVP 
polymers prevent adhesions following surgery, aid in reconstructing tissues and delivering 
drugs to the body over an extended period of time. North American Science Associates, Inc. 
reported that GVGVP polymer is non-toxic in mice, non-sensitizing and non-antigenic in 
guinea pigs, andnon-pyrogenic in rabbits (Urryetal. 1993)* Researchers have also observed 

30 that inserting sheets of GVGVP at the sites of contaminated wounds in rats reduces the 
number of adhesions that form as the wounds heal (Uny et al. 1993). In a similar manner, 
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using the GVGVP to encase muscles that are cut during eye surgery in rabbits prevents 
scarring following the operation (Urry et al. 1993, Urry 1995). Other medical applications 
of bioelastic PBPs include tissue reconstruction (synthetic ligaments and arteries, bones), 
wound coverings, artificial pericardia, catheters and programmed drug delivery (Urry, 1995; 
Urryetal., 1993, 1996). 

We have expressed the elastic PBP (GVGVP) 121 in E. coli (Guda et al. 1995, Brixey 
et al. 1 997), in the fungus Aspergillus nidulans (Herzog et al. 1 997), in cultured tobacco cells 
(Zhang et al. 1995), and in transgenic tobacco plants (Zhang et al. 1996). In particular, 
(GVGVP) 121 has been expressed to such, high levels in E. coli that polymer inclusion bodies 
occupied up to about 90% of the cell volume. Also, inclusion bodies have been observed in 
chloroplasts of transgenic tobacco plants (see attached article, Daniell and Guda, 1997). 
Recently, we reported stable transformation of the tobacco chloroplasts by integration and 
expression the biopolymer gene (EG121), into the Large Single Copy region (5,000 copies 
per cell) or the Inverted Repeat region (10,000 copies per cell) of the chloroplast genome 
(Guda et al., 1999). 

PBP as Fusion Proteins: Several systems are now available to simplify protein purification 
including the maltose binding protein (Marina et al. 1988), glutethione S-tranferase (Smith 
and Johnson 1988), biotinylated (Tsao et al. 1996), tbioredoxin (Smith et al. 1998) and 
cellulose binding (Ong et al. 1989) proteins. Recombinant DNA vectors for fusion with 
short peptides are now available to effectively utilize aforementioned fusion proteins in the 
purification process (Smith etal. 1998; Kim and Raines, 1993;Suetal. 1992). Recombinant 
proteins are generally purified by affinity chromatography, using ligands specific to carrier 
proteins (Nilsson et al. 1997). While these are useful techniques for laboratory scale 
purification, affinity chromatography for large-scale purification is time consuming and cost 
prohibitive. Therefore, economical and non-chromatographic techniques are highly 
desirable. In addition, a common solution to N-terminal degradation of small peptides is to 
fuse foreign peptides to endogenous E. coli proteins. Early in the development of this 
technique, p-galactosidase (p-gal) was used as a fusion protein (Goldberg and Goff, 1986). 
A drawback of this method was that the p-gal protein is of relatively high molecular weight 
(MW 100,000). Therefore, the proportion of the peptide product in the total protein is low. 
Another problem associated with the large p-gal fusion is early termination of translation 
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(Bumette, 1983; Hall, 1988). This occurred when (3-gal was used to produce human insulin 
peptides because the fusion was detached from the ribosome during translation thus yielding 
incomplete peptides. . Other proteins of lower molecular weight proteins have been used as 
fusion proteins to increase the peptide production. For example, better yields were obtained 
with the tryptophan synthase (190aa) fusion proteins (Hall, 1988; Burnett, 1983), 

Accordingly, one achievement according to this invention is to use poly(GVGVP) as 
a fusion protein to enable hyper-expression of insulin and accomplish rapid one step 
purification of the fusion peptide. At lower temperatures the polymers exist as more 
extended molecules which, on raising the temperature above the transition range, 
hydrophobically fold into dynamic structures called p-spirals that further aggregate by 
hydrophobic association to form twisted filaments (Urry, 1991). Through exploitation of this 
reversible property, simple and inexpensive extraction and purification is performed. The 
temperature at which aggregation takes place (TO is manipulatedby engineering biopolymers 
containing varying numbers of repeats or changing salt concentration (McPherson et al. 5 
1996). Another group has recently demonstrated purification of recombinant proteins by 
fusion with thermally responsive polypeptides (Meyer and Chilkoti, 1999). Polymers of 
different sizes have been synthesized and expressed in E, coli. This approach also eliminates 
the need for expensive reagents, equipment and time required for purification. 
Cholera Toxin p subunit as a fusion protein: Vibrio cholerae causes diarrhea by 
colonizing the small intestine and producing enterotoxins, of which the cholera toxin (CT) 
is considered the main cause of toxicity. CT is a hexameric AB 5 protein having one 27KDa 
A subunit which has toxic ADP-ribosyl transferase activity and a non-toxic pentamer of 1 1 .6 
kDa B subunits that are non-covalently linked into a very stable doughnut like structure into 
which the toxic active (A) subunit is inserted. The A subunit of CT consists of two 
fragments -Al and A2 which are linked by a disulfide bond. The enzymatic activity of CT 
is located solelyon the Alfragment (Gill, 1976). The A2 fragment of the A subunit links the 
Al fragment and the B pentamer. CT binds via specific interactions of the B subunit 
pentamer with GM1 ganglioside, the membrane receptor, present on the intestinal epithelial 
cell surface of the host. The A subunit is then translocated into the cell where it ADP- 
ribosylates the Gs subunit of adenylate cyclase bringing about the increased levels of cyclic 
AMP in affected cells that is associated with the electrolyte and fluid loss of clinical cholera 
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(Lebens et al. 1994). For optimal enzymatic activity, the Al fragment needs to be separated 
from the A2 fragment by proteolytic cleavage of the main chain and by reduction of the 
disulfide bond linking them (Mekalanos et ah, 1979). 

The Expression and assembly of CTB in transgenic potato tubers has been reported 
5 (Arakawa et al. 1997). The CTB gene including the leader peptide was fused to an 
endoplasmic reticulum retention signal (SEKDEL) at the 3 f end to sequester the CTB protein 
within the lumen of the ER. The DNA fragment encoding the 21 -amino acid leader peptide 
of the CTB protein was retained to direct the newly synthesized CTB protein into the lumen 
of the ER. Immunoblot analysis indicated that the plant derived CTB protein was 
10 antigenicaily indistinguishable from the bacterial CTB protein and that oligomeric CTB . 
molecules (Mr ~ 50 kDa) were the dominant molecular species isolated from transgenic 
potato leaf and tuber tissues. Similar to bacterial CTB, plant derived CTB dissociated into 
monomers (Mr- 15 kDa) during heat acid treatment. 

1 5 Enzyme linked immunosorbent assay methods indicated that plant synthesized CTB 

protein bound specifically to GM1 gangliosides, the natural membrane receptors of Cholera 
Toxin. The maximum amount of CTB protein detected in auxin induced transgenic potato 
leaf and tuber issues was approximately 0.3% of the total soluble protein. The oral 
immunization of CD-I mice with transgenic potato tissues transformed with the CTB gene 

20 (administered at weekly intervals for a month with a final booster feeding on day 65) has 
also been reported. The levels of serum and mucosal anti-cholera toxin antibodies in mice 
were found to generate protective immunity against the cytopathic effects of CT holotoxin. 

Following intraileal injection with CT, the plant immunized mice showed up to a 60% 
25 reduction in diarrheal fluid accumulation in the small intestine. Systemic and mucosal CTB- 
specific antibody titers were determined in both serum and feces collected from immunized 
mice by the class-specific chemiluminescent ELISA method and the endpoint titers for the 
three antibody isotypes (IgM, IgG and IgA) were determined. 

The extent of CT neutralization in both Vero cell and ileal loop experiments suggested 
30 that anti-CTB antibodies prevent CT binding to cellular GM1 -gangliosides. Also, mice fed 
with 3 g of transgenic potato exhibited similar intestinal protection as mice gavaged with 30 
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g of bacterial CTB. Recombinant LTB [rLTB] (the heat labile enterotoxin produced by 
Enterotoxigenic E. coli) which is structurally, functionally and immunologically similar to 
CTB was expressed in transgenic tobacco (Arntzen et al. 1998;HaqetaL, 1995). They have 
reported that the rLTB retained its antigenicity as shown by immunoprecipitation of rLTB 
with antibodies raised to rLTB from E. coll The rLTB protein was of the right molecular 
weight and aggregated to form the pentamer as confirmed by gel permeation 
chromatography. 

CTB has also been demonstrated to be an effective carrier molecule for induction of 
mucosal immunity to polypeptides to which it is chemically or genetically conjugated 
(McKenzie et al, 1984; Dertzbaugh et al, 1993). The production of immunomodulatory 
transmucosai carrier molecules, such as CTB, in plants may greatly improve the efficacy of 
edible plant vaccines (Haq et al, 1995; Thanavala et al, 1995; Mason et al, 1996) and may 
also provide novel oral tolerance agents for prevention of such autoimmune diseases as Type 
1 diabetes (Zhang et al, 1991), Rheumatoid arthritis (Trentham et al, 1993), multiple 
sclerosis (Khoury et al, 1990; Miller et al, 1992; Weiner et al 5 1993) as well as the 
prevention of allergic and allograft rejection reactions (Savegh et al, 1992; Hancock et al, . 
1993). Therefore, expressing a CTB-proinsulin fusion is an ideal approach for oral delivery 
of insulin. 

Chloroplast Genetic Engineering: Several environmental problems related to plant genetic 
engineering now prohibit advancement of this technology and prevent realization of its full 
potential. One such common concern is the demonstrated escape of foreign genes through 
pollen dispersal from transgenic crop plants to their weedy relatives creating super weeds or 
causing gene pollution among other crops or toxicity of transgenic pollen to non-target 
insects such as butterflies. The high rates of gene flow from crops to wild relatives (as high 
as 38% in sunflower and 50% in strawberries) are certainly a serious concern. Clearly, 
maternal inheritance (lack of chloroplast DNA in pollen) of the herbicide resistance gene via 
chloroplast genetic engineering has been shown to be a practical solution to these problems 
(Daniell et al, 1998). Another common concern is the sub-optimal production of Bacillus 
thuringiensis (B.t.) insecticidal protein or reliance on a single (or similar) B.t protein in 
commercial transgenic crops resulting in B.t. resistance among target pests. Clearly, 
different insecticidal proteins should be produced in lethal quantities to decrease the 
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development of resistance. Such hyper-expression of a novel B .t. protein in chloroplasts has 
resulted in 100% mortality of insects that are up to 40,000- fold resistant to other B.t. 
proteins (Kota et al. 1999). Therefore, chloroplast genome is an attractive target for 
expression of foreign genes due to its ability to express extraordinarily high levels of foreign 
5 proteins and efficient containment of foreign genes through maternal inheritance. 

When we developed the concept of chloroplast genetic engineering (Daniell and 
McFadden, 1988 U.S. Patents; Daniell, World Patent, 1999). It was possible to introduce 
isolated intact chloroplasts into protoplasts and regenerate transgenic plants (Carlson, 1 973). 
Therefore, early investigations on chloroplast transformation focused on the development 

10 of in organello systems using intact chloroplasts capable of efficient and prolonged 
transcription and translation (Daniell and Rebeiz, 1982; Daniell et al., 1983, 1986) and 
expression of foreign genes in isolated chloroplasts (Daniell and McFadden, 1987). 
However, after the discovery of the gene gun as a transformation device (Daniell, 1993), it 
was possible to transform plant chloroplasts without the use of isolated plastids and 

15 protoplasts. Chloroplast genetic engineering was accomplished in several phases. Transient 
expression of foreign genes in plastids of dicots (Daniell et al., 1990; Ye et al., 1990) was 
followed by such studies in monocots (Daniell et al., 1991). Unique to the chloroplast 
genetic engineering is the development of a foreign gene expression system using 
autonomously replicating chloroplast expression vectors (Daniell et al., 1990). Stable 

20 integration of a selectable marker gene into the tobacco chloroplast genome (Svab and 
Maliga, 1 993) was also accomplished using the gene gun. However, useful genes conferring 
valuable traits via chloroplast genetic engineering have been demonstrated only recently. For 
example, plants resistant to B.t. sensitive insects were obtained by integrating the crylAc 
gene into the tobacco chloroplast genome (McBride et al., 1995). Plants resistant to B.t. 

25 resistant insects (up to 40,000 fold) were obtained by hyper-expression of the cryilA gene 
within the tobacco chloroplast genome (Kota et al., 1 999). Plants have also been genetically 
engineered via the chloroplast genome to confer herbicide resistance and the introduced 
foreign genes were maternally inherited, overcoming the problem of cut-cross with weeds 
(Daniell et al., 1998). Chloroplast genetic engineering has also been used to produce 

30 pharmaceutical products that are not used by plants (Guda et al., 2000). Chloroplast genetic 
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engineering technology is currently being applied to other useful crops (Sidorov et al. 1999; 
Daniell. 1999). 

Polymer-proinsulin Recombinant DNA Vectors: First we developed independent 
chloroplast vectors for the expression of insulin chains A and B as polymer fusion peptides, 

5 as it has been produced in E. coli for commercial purposes in the past. The disadvantage of 
this method is that E. coli does not form disulfide bridges in the cell unless the protein is 
targeted to the periplasm. Expensive in vitro assembly after purification is necessary for this 
approach. Therefore, a better approach is to express the human proinsulin as a polymer 
fusion protein. This method is better because chloroplasts are capable of forming disulfide 

10 bridges. Using a single gene, as opposed to the individual chains, eliminates the necessity 
of conducting two parallel vector construction processes, as is needed for individual chains. 
In addition, the need for individual fermentations and purification procedures is eliminated 
by the single gene method. Further, proinsulin products require less processing following 
extraction. Another benefit of using the proinsulin is that the C-peptide, which is an 

15 essential part the proinsulin protein, has recently been shown to play a positive role in 
diabetic patients (Ido et al, 1997). 

Recently, the human pre-proinsulin gene was obtained from Genentech, Inc. First, 
the pre-proinsulin was sub-cloned into pUC19 to facilitate further manipulations. The next 
step was to design primers to make chloroplast expression vectors. Since we are interested 

20 in proinsulin expression, the 5' primer was designed to land on the proinsulin sequence. This 
FW primer eluded the 69 bases or 23 coded amino acids of the leader or pre-sequence of 
preproinsuhn. Also, the forward primer included the enzymatic cleavage site for the protease 
factor Xa to avoid the use of cyanogen bromide. Beside the Xa-factor, a Smal site was 
introduced to facilitate subsequent subcloning. The order of the FW primer sequence is Smal 

25 - Xa-factor - Proinsulin gene. The reverse primer includes BamHl and Xbal sites, plus a 
short sequence with homolgy with the pUC 19 sequence following the proinsulin gene. The 
297bp PCR product (Xa Pris) includes three restriction sites, which are the Smal site at the 
5'-end and Xbal/BamHl sites at the 3' end of the proinsulin gene. The Xa-Pris was cloned 
into pCR2.1 resulting in pCR2.1 - Xa- Pris (4.2kb). Insertion of Xa-Pris into the multiple 

30 cloning site of pCR2.1, resulted in additional flanking restriction enzyme sites that will be 
used in subsequent sub-cloning steps. A GVGVP 50-mer was generated as described 
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previously (Daniell et aL 1997). The ribosome binding sequence was introduced by 
digesting pUCs-10, which contains the RBS sequence GAAGGAG, with Nool and Hind III 
flanking sites. The plasmid pUC19-50 was also digested with the same enzymes. The 
50mer gene was eluted from the gel and ligated to pUCs- 10 to produce pUCs-1 0-5 Omer. The 
5 ligation step inserted into the 50mer gene a RBS sequence and a Smal site outside the gene 
to facilitate subsequent fusion to proinsulin. 

Another Smal partial digestion was performed to eliminate the stop codon of the 
biopolymer, transform the 50mer to a 40mer, and fuse the 40mer to the Xa-proinsulin 
sequence. The conditions for this partial digestion needed a decrease in DNA concentration 

10 and the 1:15 dilution of Smal. Once the correct fragment was obtained by the partial 
digestion of Smal (eliminating the stop codon but include the RBS site), it was ligated to the 
Xa-proinsulin fusion gene resulting in the construct pCR2.1-40-XaPris. Finally, the 
biopolymer (40mer) - proinsulin fusion gene was subcloned into pSBL-CtV2 (chloroplast 
vector) by digesting both vectors with Xbal. Then the fusion gene was ligated to the pSBL- 

15 CtV2 and the final vector was called pSBL-OC-XaPris. The orientation of the insert was 
checked with Nool: one the five colonies chosen had the correct orientation of the gene. 
The fusion gene was also subcloned into pLD-CtV vector and the orientation was checked 
with EooRl and Pvuil. One of the four colonies had the correct orientation of the insert. 
This vector was called pLD-OC-XaPris (Fig.2A). 

20 Both chloroplast vectors contain the 1 6S rRNA promoter (Pnn) driving the selectable 

marker gene aadA (aminoglycoside adenyl transferase conferring resistance to 
spectinomycin) followed by the psbA 3 3 region (the terminator from a gene coding for 
photosystem II reaction center components) from the tobacco chloroplast genome. The only 
difference between these two chloroplast vectors (pSBL and pLD) is the origin of DNA 

25 fragments. Both pSBL and pLD are universal chloroplast expression/integration vectors and 
can be used to transform chloroplast genomes of several other plant species (Daniell et al. 
1998) because these flanking sequences are highly conserved among higher plants. The 
universal vector uses trnA and trnl genes (chloroplast transfer RNAs coding for Alanine and 
Isoleucine) from the inverted repeat region of the tobacco chloroplast genome as flanking 

30 sequences for homologous recombination as shown in Figs. 2A and 3B. Because the 
universal vector integrates foreign genes within the Inverted Repeat region of the chloroplast 
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genome, it should double the copy number of insulin genes (from 5000 to 10,000 copies per 
cell in tobacco). Furthermore, it has been demonstrated that homoplasmy is achieved even 
in the first round of selection in tobacco probably because of the presence of a chloroplast 
origin of replication within the flanking sequence in the universal vector (thereby providing 

5 more templates for integration). Because of these and several other reasons, foreign gene 
expression was shown to be much higher when the universal vector was used instead of the 
tobacco specific vector (Guda et aL, 2000). 

DNA sequence of the polymer-proinsulin fusion was determined to confirm the 
correct orientation of genes, in frame fusion and lack of stop codons in the recombinant 

10 DNA constructs. DNA sequencing was performed using a Perkin Elmer AB1 prism 373 
DNA sequencing system using a ABI Prism Dye Termination Cycle Sequencing Kit. The 
kit uses AmpliTaq DNA polymerase. Insertion sites at both ends were sequenced using 
primers for each strand. Expression of all chloroplast vectors was first tested in E. coli 
before their use in tobacco transformation because of the similarity of protein synthetic 

15 machinery (BriseyetaL 1997). Y or Escherichia coli expression XL- 1 Blue strain was used 
E. coli was transformed by standard CaCl 2 transformation procedures. 
Expression and Purification of the Biopolymer-proinsulin fusion protein: Terrific broth 
growth medium was inoculated with 40/zl of Ampicillin (lOOmg/ml) and 40/A of the XL-1 
Blue MRF To strain of E. coli containing pSBL-OC-XaPris plasmid. Similar inoculations 

20 were made for pLD-OC-XaPris and the negative controls, which included both plasmids 
containing the gene in the reverse orientation and the E. coli strain without any plasmid. 
Then, 24hr cultures were centrifuged at 13,000rpmfor3 min. The pellets were resuspended 
in 500//1 of autoclaved dH 2 0 and transferred to 6ml Falcon tubes. The resuspended pellet 
was sonicated, using a High Intensity Ultrasonic processor, for 15 sec at an amplitude of 40 

25 and then 15 sec on ice to extract the fusion protein from cells. This sonication cycle was 
repeated 15 times. The sonicated samples were transferred to microcentrifuge tubes and 
centrifuged at 4°C at 10,000g for 10 min to purify the fusion protein. After centrifugation, 
the supernatant were transferred to microcentrifudge tubes and an equal volume of 2XTN 
buffer(100mMTrisHCI,pH8, 1 00 mMNaCl) was added. Tubes were warmed at 42° C for 

30 25 min to induce biopolymer aggregation. Then the fusion protein was recovered by 
centrifuging at 2,500 rpm at 42 °C for 3 min. The recovered fusion protein was resuspended 
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in 100,ul of cold water. The purification process was repeated twice. Also, the fusion 
protein was recovered by using 6M Guanidine hydrochloride phosphate buffer, pH 7.0 
(instead of water), to facilitate stability of insulin. New cultures were incubated for this step 
following the same procedure as described above, except that the pSBL-OC-XaPris 
5 expressing cells were incubated for 24, 48 and 72 hrs,. Cultures were centrifiiged at 4,000 
rpm for 12 niin and the pellet was resuspended in 6M Guanidine hydrochloride phosphate 
buffer, pH 7.0, and then sonicated as described above. After sonication, samples were run 
in a 1 6;5% Tricine gel, transferred to -the nitrocellulose membrane, and immunoblotting was 
performed the following day, 

10 A 15% glycine gel was run for 6h at recommended voltage as shown in Fig. 1 . Two 

different methods of extraction were used. It was observed that when the sonic extract is in 
6M Guanicine Hydrochloride Phosphate Buffer, pH7.0, the molecular weight changes from 
its original and correct MW 24 kD to a higher MW of approximately 30 kDa (Fig. 1C. I). 
This is probably due to the conformation that the biopolymer takes under this kind of buffer, 

15 which is used to maximize the extraction of proinsulin. 

The gel was first stained with 0.3M CuCl 2 and then the same gel was stained with 
Commassie R-250 Staining Solution for an hour and then destained for 1 5 min first, and then 
overnight CuCl 2 creates a negative stain (Lee et al. 1987). Polymer proteins (without 
fusion) appear as clear bands against a blue background in color or dark against a light 

20 semiopaque background (Fig. 1 A). This stain was used because other protein stains such as 
Coomassie Blue R250 does not stain the polymer protein due to the lack of aromatic side 
chains (McPherson et al., 1992). Therefore, the observation of the 24 kDa protein in R250 
stained gel (Fig. IB) is due to the insulin fusion with the polymer. This observation was . 
further confirmed by probing these blots with the antihuman proinsulin antibody. As 

25 anticipated, the polymer insulin fusion protein was observed in western blots as shown in 
Fig. 1C, even though the binding of antibody was less efficient (probably due to concealment 
of insulin epitopes by the polymer). Larger proteins observed as shown in Fig. 1C II are 
tetramer and hexamer complexes of proinsulin. 

It is evident that the insulin-polyer fusion proteins are stable in E. coli. Confirming 

30 this observation, recently another lab has shown that the PBP polymer protein conjugates 
(with thioredoxin and tendamistat) undergo thermally reversible phase transition, retaining 
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the transition behavior of the free polymer (Meyer and Chikoti, 1999). These results clearly 
demonstrate that insulin fusion has not affected the inverse temperature transition property 
of the polymer. One of the concerns is the stability of insulin at temperatures used for 
thermally reversible purification. Temperature induced production of human insulin has 
5 been in commercial use (Schmidt et al. 1999). Also, the temperature transition can be 
lowered by increasing the ionic strength of the solution during purification of this PSP 
(McPherson et al, 1996). Thus, GVGVP-fusion could be used to purify a multitude of 
economically important proteins in a simple inexpensive step. 

Biopolymer-proinsulin fusion gene expression in chloroplast: As described in section d, 
10 pSBL-OC-R40XaPris vector and pLD-OC-R40XaPris vectors were bombarded into the 
tobacco chloroplasts genome via particle bombardment (DanielL, 1997). PCR was 
performed to confirm biopolymer-proinsulin fusion gene integration into chloroplast genome. 
The PCR products were examined in 0.8% agarose gels. Fig. 2A shows primers landing sites 
and expected PCR products. Fig. 2B shows the 1 .6 kbp PCRproduct, confirming integration 
15 of the aadA gene into the chloroplast genome. This 1.6kb product is seen in all clones 
except L9, which is a mutant. We used primers 2P and 2M to confirm integration of both 
the aadA and biopolymer-proinsulin fusion gene. The 13 kbp product corresponds to the 
native chloroplast fragment and the 3.5 kbp product corresponds to the chloroplast genome 
that has integrated all three genes as shown in Figs. 2C amd D. All the clones examined at 
20 this time show heteroplasmy, exce[t c;pmes :8d om Fog/ 2C, and S41b in Fig. 2D, which 
show almost homoplasmy. 

Protease Xa Digestion of the Biopolymer-proinsulin fusion protein and Purification of 
Proinsulin: Factor Xa was purchased from New England Biolabs at a concentration of 1 .0 
mg/ml. The Factor Xa is supplied in 20mM HEPES, SOOmM, NaCl, 2mM CaCl 2 , 50% 

25 glycerol, (pH 8.0). The reaction was carried out in a 1 : 1 ratio of fusion protein to reaction 
buffer. Thereaction buffer was made with20mMTris-HCI, lOOmMNaCl^mMCaC^fcH 
8.0). The enzymatic cleavage of the fusion protein to release the proinsulin protein from the 
(GWP) 40 was initiated by adding the protease to the purified fusion protein at a ratio (ww) 
of approximately 1,500. This digestion was continued for 5 days with mild stirring at 4°C. 

30 Cleavage of the fusion protein was monitored by SDS-PAGE analysis. After the cleavage, 
the same conditions are used for purification of the proinsulin protein. The purification steps 
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are the same as for the purification of the fusion protein, except that instead of recovering 
the pellet, the supernatant is saved. We detected cleaved proinsulin in the extracts isolated 
in 6M guanidine hydrochloride buffer as shown in Fig. 1C 1 1 . Conditions can be estimized 
for complete cleavage. The Xa protease has been successfully used to cleave (GVGVP) 20 - 
5 GST fusion (McPherson et al. 1 992). Therefore, cleavage of proinsulin from GVGVP using 
the Xa protease does not pose problems. 

Vector for CTB expression in chloroplasts: The leader sequence (63 bp) of the native CTB 
gene (372 bp) was deleted and a start codon (ATG) introduced at the 5' end of the remaining 
CTB gene (309 bp). Primers were designed to introduce a rbs site 5 bases upstream of the 

1 0 start codon. The 5 1 primer (38mer) was designed to and on the start codon and the 5-end of 
the CTB gene. This primer had an Xbal site at the 5 ! -end, the rbs site [GGAGG], a 5 bp 
breathing space followed by the first 20 bp of the CTB gene. The 3' primer (32mer) was 
designed to land on the 3* end of the CTB gene and it introduced restriction sites at the 3 r end 
to facilitate subcloning. The 347 bp rCTB PCR product was subcloned into pCR2.1 

15 resulting in pcCR2.1-rCTB. The final step was insertion of rCTB into the Xbal site of the 
universal or tobacco vector (pLB-CtV2) that allows the expression of the construct in E. coli 
and chloroplasts. Restriction en2yme digestion of the pLD-LH-rCTB vector with BamHl 
was performed to confirm the correct orientation of the inserted fragment in the vector. 

Because of the similarity of protein synthetic machinery, expression of the chloroplast 

20 vector was tested in E. coli before its use in tobacco transformation. For Escherichia coli 
expression the XL-1 Blue MRF TO strain was used. E. coli was transformed by standard 
CaCl 2 transformation procedures. Transformed E. coli (24 hrs culture and 48 hrs culture in 
100ml TB with lOOmg/ml ampicillin) and untransformed E. coli (24 hrs culture and 48 hrs 
culture in 100ml TB with 12.5mg/ml tetracycline) was then centrifiiged at 10000 x g in a 

25 Beckman GS- 1 5R centrifuge for 1 5 min. The pellet was washed with 200mM Tris-Cl twice 
and resuspended in 500/^1 extraction buffer (200rnM Tris-Cl, pH8.0, lOOmM NaCl; lOmM 
EDT A, 2mM PMSF) and then sonicated using the Autotune Series High Intensity Ultrasonic 
Processor- Then, 100/-4 aliquots of the sonicated transformed and untransformed cells 
[containing 50 - lOOyUg of crude protein extract as determined by Bradford protein assay 

30 (Bio-Rad Inc)] and purified CTB (Sigma C-9903) were boiled with 2X SDS sample buffer 
and separated on a 15% SDS-PAGE gel in Tris-glycine buffer (25mM Tris, 250 mM glycine, 
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pH8 .3 , 0 . 1 % SDS) . The separated protein was then transferred to a nitrocellulo se membrane 
by electro blotting using the Trans-Blot Electrophoretic Transfer Cell (Bio-Rad Inc.). 
Immunoblot detection ofCTB expression inE. colt Nonspecific antibody reactions were 
blocked by incuhation of the membrane in 25ml of 5% non-fat dry milk in TBS buffer for 
1 - 3 hrs on a rotary shaker (40 rpm), followed by washing in TBS buffer for 5 min. The 
membrane was then incubated for an hour with gentle agitation in 30 ml of a 1 :5000 dilution 
of rabbitanti-cholera antiserum (Sigma C-3062) in TBS with Tween-20 [TBST] (containing 
1% non-fat dry milk) followed by washing 3 times in TEST buffer. The membrane was 
incubated for an hour at room temperature with gentle agitation in 30 ml of a 1:10000 
dilution of mouse anti-rabbit lgG conjugated with alkaline phosphatase in TBST. It was then 
washed thrice with TBST and once with TBS followed by incubation in the Alkaline 
Phosphatase Color Development Reagents, BCIP/NBT in AP color development Buffer 
(Bio-Rad 5 Inc.) for an hour. Immunoblot analysis snows the presence of 11.5 kDa 
polypeptide for purified bacterial CTB and transformed 24h/ 48h cultures (Fig. 3 A, lanes 2, 
3 and 5). The 48h culture appears to express more CTB than that of the 24h culture 
indicating the accumulation of the CTB protein over time. The purified bacterial CTB (45 
Kda) dissociated into monomers (11.5 KDa each) due to boiling prior to SDS PAGE. These 
results indicate that the pLD-LH-CTB vector is expressed in E. colL Because of the 
similarity of the E. coli protein synthetic machinery to that of chloroplasts, chloroplast 
expression of the above vector should be possible. 

CTB expression in chloroplasts: As described below, pLD-LH-CTB was integrated into 
the tobacco chloroplast genome via particle bombardment (Daniell, 1997). PCR analysis 
was performed to confirm chloroplast integration. Fig. 3B shows primer landing sites and 
size of expected products. PGR analysis of clones obtained after the first round of selection 
was carried out as described below. PCR products were examined on 0.8% agarose gels. 
The PCR results (Fig, 3C) show that clones 1 and 5 that do not show any product are 
mutants while clones 2, 3, 4, 6, 7, 8, 9, 10 and 11 that gave a 1.65 kbp product are 
transgenic. As expected, lanes 13 - 15 did not give any PCR product, confirming that the 
PCR reaction was not contaminated. Because primers 3P & 3M land on the aadA gene and 
on the chloroplast genome, all clones that show PCR products have integrated the CTB gene 
and the selectable marker into the chloroplast genome. Clones that showed chloroplast 
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integration of the CTB gene were moved to the second round of selection to increase copy 
number. PCR analysis of clones obtained after the second round of selection was also 
carried out. PCR results shown in Fig. 3D indicate that clone 5 does not give a 3 kbp 
product indicating that it is a mutant as observed earlier. Other clones give a strong 3 kbp 
5 product and a faint L3 kbp (similar to the 1.3 kbp untransformed plant product) product, 
indicating that they are transgenic but not yet homoplasmic. Complete homoplasmy can be 
accomplished by several more rounds of selection or by germinating seeds from transgenic 
plants on 500 ,ug/ml of spectinomycin. 

CTB-Proinsulin Vector Construction: The chloroplast expression vector pLD-CTB-Proins 

10 was constructed as follows. First, both proinsulin and cholera toxin B-subunit genes were 
amplified from suitable DNA using primer sequences. Primer 1 contains the GGAGG 
chloroplast preferred ribosome binding site five nucleotides upstream of the start codon 
(ATG) for the CTB gene and a suitable restriction enzyme site (Spei) for insertion into the 
chloroplast vector. Primer 2 eliminates the stop codon and adds the first two amino acids 

15 of a flexible hinge tetrapeptide GPGP as reported by Bergerot et al. (1997), in order to 
facilitate folding of the CTB-proinsulin fusion protein. Primer 3 adds the remaining two 
amino acids for the hinge tetra-peptide and eliminates the pre-sequence of the pre-proinsulin. 
Primer 4 adds a suitable restriction site (Spel) for subcloning into the chloroplast vector. 
Amplified PCR products were inserted into the TA cloning vector. Both the CTB and 

20 proinsulin PCR fragments were excised at the Smal and Xbal restriction sites. Eluted 
fragments were ligated into the TA cloning vector. Interestingly, all white colonies showed 
the wrong orientation for CTB insert while three of the five blue colonies examined showed 
the right orientation of the CTB insert The CTB-proinsulin fragment was excised at the 
EcoRl sites and inserted into EcoRl digested dephosphorolated pLD vector. Resultant 

25 onicroplast integration expression vector, pLD-CTB-Proins will be tested for expression in 
E. colt by western blots. After confirmation of expression of CTB-proinsulin fusion in E. 
coli, pLD-CTB-Proins will be bombarded into tobacco cells as described below. 
Optimization of fusion gene expression: It has been reported that foreign genes are 
expressed between 5% (crylAQ cryllA) and 30% (uldA) in transgenic chloroplasts (Daniell, 

30 1 999). If the expression levels of the CTB-Proinsulin or polymer-proinsulin fusion proteins 
are low, several approaches will be used to enhance translation of these proteins. In 
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chloroplast, transcriptional regulation of gene expression is less important, although some 
modulations by light and developmental conditions are observed (Cohen and Mayfield, 
1997). RNA and protein stability appear to be less important because of observation of large 
accumulation of foreign proteins (e.g. GUS up to 30% of total protein) and Ipsl transcripts 
16,966-fold higher than the highly expressing nuclear transgenic plants. Chloroplast gene 
expression is regulated to a large extent at the post-transcriptional level. For example, 5' 
UTRs are used for optional translation of chloroplast mRNAs. Shine-Delgarno (GGAGG) 
sequences as well as a stem-loop structure located 5' adjacent to the SD sequence are used 
for efficient translation. A recent study has shown that insertion of the psbA 5 1 UTR 
downstream of the 16S rRNA promoter enhanced translation of a foreign gene (GUS) 
hundred-fold (Eibl et al. 1999). Therefore, the 85-bp tobacco chloroplast DNA fragment 
(1595 - 1680) containing 5' psbA UTR will be amplified using the following primers 
cctttaaaaagccttccattttctattt, gccatggtaaaatctfggtttatta. This PCR product will be inserted 
downstream of the 16S rRNA promoter to enhance translation of the proinsulin fusion 
proteins. 

Yet another approach for enhancement of translation is to optimize codon 
compositions of these fusion protein. Since both fusion proteins are expressed well in E. 
coli, we expected efficient expression in chloroplasts. However, optimizing codon 
compositions of proinsulin and CTB genes to march the psbA gene could further enhance 
the level of translation. Although rbcL (RuBisCO) is the most abundant protein on earth, 
it is not translated as frequently as the psbA gene due to the extremely high turnover of the 
psbA gene product. The psbA gene is under stronger selection for increased translation 
efficiency and is the most abundant thylakoid protein. In addition, codon usage in higher 
plant chloroplasts is biased towards the NNC codon of 2-fold degenerate groups (i.e. TTC 
over TTT, GAC over GAT, CAC over CAT, AAC over AAT, ATC over ATT, ATA etc.). 
This is in addition to a strong bias towards T at third position of 4-fold degenerate groups. 
There is also a context effect that should be taken into consideration while modifying 
specific codons. The 2-fold degenerate sites immediately upstream from a GNN codon do 
not show this bias towards NNC, (TTT GGA is preferred to TTC GGA while TTC CGT is 
preferred to TTT CGT TTC AGT to TTT AGT and TTC TCT to TTT TCT). In addition, 
highly expressed chloroplast genes use GNN more frequently than other genes. The web site 
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may be used optimize codon composition by comparing different 
species. Abundance of amino acids in chloroplasts can be taken into consideration 
(pathways compartmentalized in plastids as opposed to those that are imported into plastids). 

As far as the biopolymer gene is concerned, we observed incomplete translation 
products in plastids when we expressed the 120mer gene (Guda et al. 2000). Therefore, 
while expressing the polymer-proinsulin fusion protein, we decreased the length of the 
polymer protein to 40mer, without losing the thermal responsive property. In addition, 
optimal codons for glycine (GGT) and valine (GTA), which constitute 80% of the total 
amino acids of the polymer, have been used. In all nuclear encoded genes glycine make up 
147/1 000 amino acids while in tobacco chloroplasts it is 129/1 000. Highly expressing genes 
like psbA and rbcL of tobacco make up 192 and 190gly/1000. Therefore, glycine may not 
be a limiting factor. Nuclear genes use 52/1000 proline as opposed to 42/1000 in 
chloroplasts. However, currently used codon for proline (CCG) can be modified to CCA or 
CCT to further enhance translation. It is known that pathways for proline and valine are 
compartmentalized in chloroplasts (Guda et al. 2000). Also, proline is known to accumulate 
in chloroplasts as an osmoprotectant (Daniell et al. 1994). 

Bombardment and Regeneration of Chloroplast Transgenic Plants: Tobacco (Nicotiana 
tabacurn var. Petit Havana) and nicotine free edible tobacco (LAMD 665, gift from Dr. 
Keith Wycoff. Planet Biotechnology) plants are grown aseptically by germination of seeds 
on MSO medium. THis medium contains MS salts (4.3 g/liter), B5 vitamin mixture (myo- 
inositol, 100mg/liter;thiamine-HCl. lOmg/liter nicotinic acid. 1 mg/liter; pyridoxine-HCL. 
1 mg/liter), sucrose (30 g/liter) and phytagar (6 g/liter) at pH 5.8. Fully expanded, dark 
green leaves of about two month old plants are used for bombardment. 

Leaves are placed abaxial side up on a Whatman No. 1 filter paper laying on the 
RMOP medium (Daniell, 1993) in standard petri plates (100x15 mm) for bombardment. 
Tungsten (1 ^m) or Gold (0.6 //m) microprojectiles are coated with plasmid DNA 
(chloroplast vectors) and bombardments carried out with the biolistic device PDSIOOO/He 
(Bio-Rad) as described by Daniell (1997). Following bombardment, petri plates are sealed 
with parafilm and incubated at 24°C under 12 h photoperiod. Two days after bombardment, 
leaves are chopped into small pieces of -5 mm 2 in size and placed on the selection medium 
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(RMOP containing 500 jug/ml of spectinomycin dihydrochloride) with abaxial side touching 
the medium in deep (100x25 nun) petri plates (-10 pieces per plate). The regenerated 
spectinomycin resistant shoots are chopped into small pieces (-2mm 2 ) and subcloned into 
fresh deep petri plates (~5 pieces per plate) containing the same selection medium. Resistant 
5 shoots from the second culture cycle arbe transferred to the rooting medium (MSO medium 
supplemented with IB A. 1 mg/liter and spectinomycin dihydrochloride, 500 mg/liter). 
Rooted plants are transferred to soil and grown at 26°C under continuous lighting conditions 
for further analysis. 

Polymerase Chain Reaction: PCR is performed using DNA solated from control and 

10 transgenic plants to distinguish a) true chloroplast transforrnants from mutants and b) 
chloroplast transforrnants from nuclear transforrnants. Primers for testing the presence of 
the aadA gene (that confers spectinomycin resistance) in transgenic pants are landed on the 
aadA coding sequence and 16S rRNA gene (primers 1P&1M.). To test chloroplast 
integration of the insulin gene, one primer lands on the aadA gene, while another lands on 

15 the native chloroplast genome (primers 3P&3M) as shown in Figs. 2A and 3B. No PCR 
product is obtained with nuclear transgenic plants using this set of primers. The primer set 
(2P & 2M, in Figs. 2A and 3B) is used to test integration of the entire gene cassette without 
internal deletion or looping out during homologous recombination. A similar strategy has 
been used successfully to confirm chloroplast integration of foreign genes (Daniell et al, 

20 1998; Kota et al, 1999; Guda et aL, 1999). This screening is essential to eliminate mutants 
and nuclear transforrnants. 

Total DNA from unbombarded and transgenic plants is isolated as described by 
Edwards et al., (1991) to conduct PCR analyses in transgenic plants. PCR reactions are 
performed in a total volume of 50 jjA containing approximately 10 ng of template DNA and 

25 1 juM of each primer in a mixture of 300 ,uM of each deoxynucleotide (dNTPs), 200 mM 
Tris (pH 8.8), 100 mM KC1, 100 mM (NH A ) 2 S0 4i 20 mM MgS0 4 , 1% Triton X-100, 1 
mg/ml nuclease-free BSA and 1 or 2 units of Taq Plus polymerase (Stratagene, La Jolla, 
CA). PCR is carried out in the Perkin Elmer's GeneAmp PCR system 2400, by subjecting 
the samples to 94°C for 5 min and 30 cycles of 94°C for 1 min, 55°C for 1 .5 min, 72°C for 

30 1.5 or 2 min followed by a 72°C step for 7 min. PCR products are analyzed by 
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electrophoresis on 0.8% agarose gels. Chloroplast transgenic plants containing the 
proinsulin gene are then moved to second round of selection to achieve homoplasmy. 
Southern Blot Analysis: Southern blots are performed to determine the copy number of the 
introduced foreign gene per cell as well as to test homoplasmy. There are several thousand 
copies of the chloroplast genome present in each plant cell. Therefore, when foreign genes 
are inserted into the chloroplast genome, it is possible that some of the chloroplast genomes 
have foreign genes integrated while others remain as the wild type (heteroplasmy). 
Therefore, to ensure that only the transformed genome exists in cells of transgenic plants 
(homoplasmy), the selection process is continued. To confirm that the wild type genome 
does not exist at the end of the selection cycle, total DNA from transgenic plants should be 
probed with the chloroplast border (flanking) sequences (the trnl-trnA fragment, Figs. 2A 
and 3B). If wild type genomes are present (heteroplasmy), the native fragment size is 
observed along with transformed genomes. Presence of a large fragment (due to insertion 
of foreign genes within the flanking sequences) and absence of the native small fragment 
confirms homoplasmy (Daniell et al., 1998; Kota et aL, 1999; Guda et al., 1999). 

The copy number of the integrated gene is determined by establishing homoplasmy 
form the transgenic chloroplast genome. Tobacco chloroplasls contain 5000-10,000 copies 
of their genome per cell (Daniell et al., 1998). If only a fraction of the genomes are actually 
transformed, the copy number, by default, must be less than 10,000. By establishing that in 
the trangenics the insulin inserted transformed genome is the only one present, one can 
establish that the copy number is 5000-10,000 per cell. This is usually achieved by 
digesting the total DNA with a suitable restriction enzyme and probing with the flanking 
sequences that enable homologous recombination into the chloroplast genome. The native 
fragment present in the control should be absent in the transgenics. The absence of native 
fragment proves that only the transgenic chloroplast genome is present in the cell and there 
is no native, untransformed, chloroplast genome, without the insulin gene present. This 
establishes the homoplasmic nature of the transformants, simultaneously, thereby providing 
an estimate of 5000-10,000 copies of the foreign genes per cell. 

Total DNA is extracted from leaves of transformed and wild type plants using the 
CTAB procedure outlined by Rogers and Bendich (1988). Total DNA is digested with 
suitable restriction enzymes, electrophoresed on 0.7% agarose gels and transferred to nylon 
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membranes (Micron Separation Inc., Westboro, MA). Probes are labeled with 32 P-dCTP 
using the random-primed procedure (Promega). Pre-hybridization and hybridization steps 
are carried out at 42°C for 2 h and 16 h, respectively. Blots are soaked in a solution 
containing 2X SSC and 0.5% SDS for 5 inin followed by transfer to 2X SSC and 0. 1% SDS 
5 solution for 15 min at room temperature. Then, blots are incubated in hybridization bottles 
containing 0. IX SSC and 0.5% SDS solution for 30 min at 37°C followed by another step 
at 68°C for 30 min, with gentle agitation. Finally, blots are briefly rinsed in 0.1X SSC 
solution, dried and exposed to X-ray film in the dark. 

Northern Blot Analysis : Northern blots are performed to test the efficiency of transcription 
10 of the proinsulin gene fused with CTB or polymer genes. Total RNA is isolated from 150 
mg of frozen leaves by using the "Rneasy Plant Total RNA Isolation Kit" (Qiagen Inc., 
Chatsworth, CA). RNA (10-40 mg) is denatured by formaldehyde treatment, separated on 
a 1 .2% agarose gel in the presence of formaldehyde and transferred to a nitrocellulose 
membrane (MSI) as described in Sambrook et al. (1989). Probe DNA (proinsulin gene 
15 coding region) is labeled by the random-primed method (Promega) with 32 P-dCT isotope. 
The blot is pre-hybridized, hybridized and washed as described above for southern blot 
analysis. Transcript levels are quantified by the Molecular Analyst Program using the GS- 
700 Imaging Densitometer (Bio-Rad, Hercules, CA). 

Polymer-insulin fusion protein purification, quantitation and characterization: Because 
20 polymer insulin fusion proteins exhibit inverse temperature transition properties as shown 
in Figs. 1 A and B, they are purified from transgenic plants essentially following the same 
method for polymer purification from transgenic tobacco plants (Zhang et al., 1996). 
However, an additional step is introduced to take advantage of the compartmentalization of 
insulin polymer fusion protein within chloroplasts. Chloroplasts are first isolated from crude 
25 homogenate of leaves by a simple centrifugation step at ISOOXg. This eliminates most of 
the cellular organelles and proteins (Daniell at al., 1983, 1986). Then, chloroplasts are burst 
open by resuspending them in a hypotonic buffer (osmotic shock). This is a significant 
advantage because there are fewer soluble proteins inside chloroplasts when compared to 
hundreds of soluble proteins in the cytosol. Polymer extraction buffer contains 50 mM Tris- 
30 HC1, pH 7.5, 1% 2-mecaptoethanol, 5mM BDTA and 2mM PMSF and 0.8 M NaCl. The 
homogenate is then centrifuged at 10,000 g for 10 min (4°C), and the pellet discarded. The 
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supernatant is incubated at 42°C for 30 minutes and then centrifuged immediately for 3 
minutes at 5,000 g (room temperature). If insulin is found to be sensitive to this temperature, 
Ti is lowered by increasing salt concentration (McPherson et al. 5 1996). The pellet 
containing the insulin-polymer fusion protein is resuspended in the extraction buffer and 

5 incubated on ice for 1 0 minutes. The mixture is centrifuged at 12,000 g for 1 0 minute (4°C). 
The supernatant is then collected and stored at -20°C. The purified polymer insulin fasion- 
protein is electrophoresed in a SDS-PAGE gel according to Laemml (1970) and visualized 
by either staining with 0.3 M CuCI 2 (Lee et al., 1987) or transferred to nitrocellulose 
membrane and probed with antiserum raised against the polymer or insulin protein as 

10 described below. Quantification of purified polymer proteins may then be carried out by 
densitometry. 

After electrophoresis, proteins are transferred to a nitrocellulose membrane 
electrophoretically in 25 mM Tris, 192mM glycine, 5% methanol (pH 8.3). The filter is 
blocked with 2% dry milk in Tris-buffered saline for two hours at room temperature and 

15 stained with antiserum raised against the polymer AVGVP (kindly provided by the 
University of Alabama at Birmingham, monoclonal facility) overnight in 2% dry milk/Tris 
buffered saline. The protein bands reacting to the antibodies are visualized using alkaline 
phosphatase-linked secondary antibody and the substrates nitroblue tetrazolium and 5- 
bromo-4-chloro-3-indolyl-phosphate (Bio-Rad). Alternatively, for insulin-polymer fusion 

20 proteins, a Mouse anti-human proinsulin (IgGl) monoclonal antibody is used as a primary 
antibody. To detect the binding of the primary antibody to the recombinant proinsulin, a 
Goat anti-mouse IgG Horseradish Peroxidase Labeled monoclonal antibody (HPR) is used. 
The substrate used for conjugation withHPRisS^^S^^-Tetramethylbenzidine. Allproducts 
are available from American Qualex Antibodies, San Clemente ) CA. As a positive control, 

25 human recombinant proinsulin from Sigma may be used. This human recombinant 
proinsulin was expressed in E. coli by a synthetic proinsulin gene. Quantification of purified 
polymer fusion proteins is carried out by densitometry using Scanning Analysis software 
(BioSoft, Ferguson, MO) installed on a Macintosh LC m computer (Apple Computer, 
Cupertino, USA) with a 1 60-Mb hard disk operating on a System 7. 1, connected by SCSI 

30 interface to a Relisys RELI 2412 Scanner (Relisys, Milpitas, CA). Total protein contents is 
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then determined by the dye-binding assay using reagents supplied in kit fro Bio-Rad, with 
bovine serum albumin as a standard. 

Characterization of CTB expression: CTB protein levels in transgenic plants are 
determined using quantitative ELISA assays. A standard curve is generated using known 

5 concentrations of bacterial CTB. A 96-weli microtiter plate padded with 100 (aVwgII of 
bacterial CTB (concentrations in the range of 10 - 1000 ng) is incubated overnight at 4°C. 
The plate is washed thrice with PBST (phosphate buffered saline containing 0.05% Tween- 
20). The background is blocked by incubation in 1 % bovine serum albumin (BSA) in PBS 
(300 1/well) at 37°C for 2 h followed by washing 3 times with PBST. The plate is incubated 

10 in a 1 :8,000 dilution of rabbit anti-cholera toxin antibody (Sigma C-3062) (100 /^1/well) for 
2 h at 37°C, followed by washing the wells three times with PBST. The plate is incubated 
with a 1 : 80,000 dilution of anti-rabbit IgG conjugated with alkaline phoshatase (100 /^1/well) 
for 2 h at 37°C and washed thrice with PBST. Then, 1 00 fj\ alkaline phosphatase substrate 
(Sigma Fast p-nitrophenyl phosphate tablet in 5 ml of water is added and the reaction 

15 stopped with 1M NaOH (50 /^l/well) when absorbancies in the mid-range of the titration 
reach about 2.0, or after 1 hour, whichever comes first The plate is then read at 405nm, 
These results are used to generate a standard curve from which concentrations of plant 
protein can be extrapolated. Thus, total soluble plant protein (concentration previously 
determined using the Bradford assay) in bicarbonate buffer, pH 9.6 (15 nM Na2Co 3> 35mM 

20 NaHC0 3 ) is loaded at 100 plant /il/well and the same procedure as above can be repeated. 
The absorbance values are used to determine the ratio of CTB protein to total soluble plant 
protein, using the standard curve generated previously and the Bradford assay results. 
Inheritance of Introduced Foreign Genes: In initial tobacco transformants, some are 
allowed to self-pollinate, whereas others are used in reciprocal crosses with control tobacco 

25 (transgenics as female acceptors and pollen donors: testing for maternal inheritance). 
Harvested seeds (Tl) are germinated on media containing spectinomycin. Achievement of 
homoplasmy and mode of inheritance can be classified by looking at germination results. 
Homoplasmyis indicatedby totally green seedlings (Daniell etal., 1998) while heteroplasmy 
is displayed by variegated leaves (lack of pigmentation, Svab & Maliga, 1993). Lack of 

30 variation in chlorophyll pigmentation among progeny also underscores the absence of 
position effect, an artifact of nuclear transformation. Maternal inheritance may be 
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demonstrated by scie transmission of introduced genes via seed generated on transgenic 
plants, regardless of pollen source (green seedlings on selective media). When transgenic 
pollen is used for pollination of control plants, resultant progeny does not contain resistance 
to chemical in selective media (will appear bleached; Svab and Maliga, 1993). Molecular 
analyses confirms transmission and expression of introduced genes, and T2 seed is generated 
from those confirmed plants by the analyses described above. 

Comparison of Current Purification with Polymer-based Purification Methods: It is 

important to compare purification methods to test yield and purity, of insulin produced in E. 
coli and tobacco. One liter of pSBL containing bacteria is grown in LB/ampicillin (100 
Mg/ml) overnight and the fusion protein expressed. Cells are harvested by centrifugation at 
5000 X g for 10 min at 4°C, and the bacterial pellets resuspended in 5 ml/g (wet wt. 
Bacteria) of 100 mM Tris-HCl, pH 7.3. Lysozyme is added at a concentration of 1 mg/ml 
' and placed on a rotating shaker at room temperature for 15 min. The lysate is subjected to 
probe sonication for two cycles of 30 s on/30 s off at 4°C. Cellular debris is removed by 
centrifugation at 1000 X g for 5 min at 4°C. Insulin polymer fusion protein is purified by 
inverse temperature transition properties (Daniell et al., 1997). Alternatively, the fusion 
protein is purified according to Cowley and Mackin (1997). The supernatant is retained and 
centrifuged again at 27000 X g for 15 min at 4°C to pellet the inclusion bodies. The 
supernatant is discarded and the pellet resuspended in 1 ml/g (original wt. Bacteria) of dH 2 0, 
aliquoted into microcentrifuge tubes as 1 ml fractions, and then centrifuged at 1 6000 X gfor 
5 min at 4°C. The pellets are individually washed with 1 ml of 100 mM Tris-HCl, pH 8.5, 
1M urea, 1-1 Triton X-100 and again washed with 100 mM Tris HC1 pH8.5, 2 M urea, 2% 
Trinton X-100. The pellets are resuspended in 1 ml of dll,0 and transferred to a pre- 
weighted 30 ml Corex centrifuge tube. The sample is centrifuged at 1 5000 X g for 5 min at 
4°C, and the pellet resuspended in 10 ml/g (wet wt. pellet) of 70% formic acid. Cyanogen 
bromide is added to a final concentration of 400 mM and the sample incubated at room 
temperature in the dark for 16 h. The reaction is stopped by transferring the sample to a 
round bottom flask and removing the solvent by rotary evaporation at 50°C. The residue is 
resuspended in 20 ml/g (wet wt. pellet) of dH z O, shell frozen in a dry ice ethanol bath, and 
then lyophilized. The lyophilized protein is dissolved in 20 ml/g (wet wt. pellet) of 500 mM 
Tris-HCl, pH 8.2, 7 M urea. Oxidative sulfitolysis is performed by adding sodium sulfite 
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and sodium tetrathionate to final concentrations of 100 and 10 mM 5 respectively, and 
incubating at room temperature for 3 h. This reaction is then stopped by freezing on dry ice. 

Purification and folding of Human Proinsulin: The S-sulfonated material is applied to a 
2 ml bed of Sephadex G-25 equilibrated in 20 mM Tris-HCl, pH 8.2, 7 M urea, and then 
washed with 9 vols of 7 M urea. The collected fraction is then applied to a Pharmacia Mono 
Q HR 5/5 column equilibrated in 20 mM Tris-HCl, pH 8.2, 7 M urea at a flow rate of 1 
ml/min. A linear gradient leading to final concentration of 0.5 M NaCl is used to elute the 
bound material. 2 min (2 ml) fractions are collected during the gradient, and protein 
concentration in each fraction determined. Purity and molecular mass of fractions are 
estimated by Tricine SDS-PAGE (as shown in Fig. 2), where Tricine is used as the trailing 
ion to allow better resolution of peptides in the range of 1-1000 kDa. Appropriate fractions 
are pooled and applied to a L6 X 20 cm column of Sephadex G-25 (superfine) equilibrated 
in 5 mM ammonium acetate pH 6.8. The sample is collected based on UV absorbance and 
freeze-dried. The partially purified S-sulfonated material is resuspended in 50 mM 
glycine/NaOH, pH 10.5 at a final concentration of 2 mg/ml. P-mer-captoethanol is added ■ 
at a ratio of 1 .5 mol per mol of cysteine S-sulfonate and the sample stirred at 4°C in an open 
container for 16 h. The sample is then analyzed by reversed-phase high-performance liquid 
chromatography (RP-HPLC) using a Vydac C 4 column (2.2 X 150 mm) equilibrated in 4% 
acetonitrileand0.1%TFA. Adsorbed peptides are eluted with a linear gradient of increasing 
acetonitrite concentration (0.88% per min up to a maximum of 48%). The remaining 
refolded proinsulin are centrifuged at 16000 X g to remove insoluble material, and loaded 
onto a semi-preparative Vydad C 4 column (10 X 250 mm). The bound material is then 
eluted as described above, and the proinsulin collected and lyophilized. 
Analysis and characterization of insulin expressed in E. coli and Tobacco: The purified 
expressed proinsulin is subjected to matrix-assisted laser desorption/ionization-time of flight 
(MALDI-TCF) analysis (as described by Cowley andMackin, 1997), using proinsulin from 
Eli Lilly as both an internal and external standard. A proteolytic digestion is performed 
using Staphylococcus aureus protease V8 to determine if the disulfide bridges have formed 
correctly naturally inside chloroplasts or by in vitro processing. Five ^g of both the 
expressed proinsulin and Eli Lilly's proinsulin are lyophilized and resuspended in 50 /A of 
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250 mM NaP0 4 pH 7.8. Protease V8 is added at a ratio of 1:50 (w/w) in experimental 
samples and no enzyme added to the controls. All samples are then incubated overnight at 
37°C, the reactions stopped by freezing on dry ice, and samples stored at -20°C until 
analyzed. The samples are analyzed by RP-HPLC using a Vydac C 4 column (2.2 X 1 50 mm) 
5 equilibrated in 4% acetonitrite and 0. 1 % TFA. Bound material is then eluted using a linear 
gradient of increasing acetonitrile concentration (0.88% per min up to a maximum of 48%). 

CTB-GM1 ganglioside binding assay: A GM 1 -ELISA assay is performed as described by 
Arakawa et al. (1997) to determine the affinity of plant-derived CTB for GM 1 -ganglioside. 

10 The microtiter plate is coated with monosialogangliosice-GMl (Sigma G-7641) by 
incubating the plate with 100 Atl/well of GM1 (3.0 Mg/ml) in bicarbonate buffer, pH 9.6 at 
4°C overnight Alternatively, the wells are coated with 100 //l/well of BSA (3.0 jx^/wl) as 
control. The plates are incubated with transformed plant total soluble protein and bacterial 
CTB (Sigma C-9903) in PBS (100 ^l/well) overnight at 4°C. The remainder of the 

15 procedure is then identical to the ELISA described above. 

Mouse feeding assays for CTB: This is performed as described by Haq et al. (1995). 
B ALB/c mice, divided into groups of five animals each, are fasted overnight before feeding 
them transformed edible tobacco (that tastes like spinach) expressing CTB, untransformed 
edible tobacco and purified bacterial CTB. Feedings are performed at weekly intervals (0, 

20 7, 14 days) for three weeks. Animals are observed to confirm complete consumption of 
material. On day 20, fecal and serum samples are collected from each animal for analysis 
of anti-CTB antibodies. Mice are bled retro-orbitally and the samples stored at -20°C until 
assayed Fecal samples are collected and frozen overnight at -70°C, lyophilized 3 
resuspended in 0.8 ml PBS (pH7.2) containing 0.05% sodium azide per 15 fecal pellets, 

25 centrifuged at 1400xg for 5 min and the supernatant stored at -20°C until assayed. Samples 
are then serially diluted in PBS containing 0.05% Tween-20 (PBST) and assayed for anti- 
CTB IgG in serum and anti-CTB IgA in fecal pellets by the ELISA method, as described 
earlier. 

30 Assessment of diabetic symptoms in NOD mice: The incidence of diabetic symptoms is 
compared among mice fed with control nicotine free edible tobacco and those that express 
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the CTB-proinsulin fusion protein. Four week old female NOD mice are divided into two 
groups, each group consisting of ten mice. Each group is fed with control or transgenic 
edible tobacco (nicotine free) expressing the CTB-proinsulin fusion gene. The feeding 
dosage is determined based on the level of expression. Starting at 1 0 weeks of age, the mice 
are monitored on a biweekly basis with urinary glucose test strips (Clinistix and Diastix, 
Bayer) for development of diabetes. Glycosuric mice are bled from the tail vein to check for 
glycemia using a glucose analyzer (Accu-Check, Boehringer Mannheim). Diabetes is 
confirmed by hyperglycemia (>250 mg/dl) for two consecutive weeks (Ma et aL, 1997). 



WO 01/72959 



35 



— PCT/US01/06288" 



References to Project Description 

ArakawaT.YuJXhong ? DKX>H^^ Aplant-based 
cholera toxin B subunit-insulin fusion protein protects against the development of 
autoimmune diabetes. Nature Biotechnology. 16:934-938. 

Arakawa T. Chong DKX. Merritt JL. Langridge WHR (1997) Expression of cholera toxin 
B subunit oligomers in transgenic potato plants. Transgenic Research 6:403-413. 

Amtzen CL, Mason H.S., et al (1998) Edible vaccine protects mice against E. coli heat labile 
enterotoxin: potatoes expressing a synthetic LTB gene. Vaccine.l6(13): 1336-1343 

Bendich AJ (1987) Why do chloroplasts and mitochondria contain so many copies of their 
genome? Bioassays 6:279-282. 

Bergerot I, Ploix C, Peterson J, Moulin V, Rask C, Fabien N, et al. (1997) A cholera toxoid- 
insulin conjugate as an oral vaccine against spontaneous autoimmune diabetes. Proc. 
Natl. Acad. Sci. USA. 94:4610-4614. 

Bumette JP (1983) Experimental Manipulation of Gene Expression. Oxender, DI. 5 Fox, 
C.F., ets. Pp. 71-82. Alan R. Liss, Inc., New York, NY 

Brixey J, Guda C, Daniell H (1997) The chloroplast psbA promoter is more efficient in E. 
coli than the T7 promoter for hyper expression of a foreign protein. Biotechnology 
Letters 19:395-400. 

Carlson PS (1973) The use of protoplasts for genetic research. Proc. Natl. Acad. Sci. USA 
70:598-602 

Chance R.E. Frank BH ( 1 993) Research, development, production and safety of biosynthetic 
human insulin. Diabetes Care. 16(3):133-142. 



WO 01/72959 



36 



— PCT/US01/06288 



Cohen A. Mayfield (1997) Translational regulation of gene expression in plants. Current 
Opinion in Biotechnology 8: 189-194. 

Cowley DJ. MackinRB (1997) Expression, purification and characterization of recombinant 
human proinsulin. FEBS Letts. 402: 124-130. 

Crea R. Kraszewski A. Kirose T, Itakura K (1978) Chemical synthesis of genes for human 
insulin. Proc. Natl. Acad. Sci. 75(12): 5765-5769. 

Daniell EL (1993) Foreign gene expression in chloroplasts of higher plants mediated by 

rungsten particle bombardment Methods Enzymol 217:536-556. 
Daniell EL (1995) Producing polymers in plants and bacteria. Inform 6: 1365-1370. 

Daniell H. (1997) Transformation and foreign gene expression in plants mediated by 
microprojectile bombardment. Meth. Mol. Biol 62:453-488 

Daniell H. (1999) Universal chloroplast integration and expression vectors, transformed 
plants and products thereof, World Intellectual Property Organization WO 99: 105 13. 

Daniell H. Datta R. Varma S. Gray S. Lee SB (1998) Containment of herbicide resistance 
through genetic engineering of the chloroplast genome. Nature Biotechnology 
16:345-348. 

Daniell H. Guda C. (1997) Biopolymer production in microorganisms and plants. Chemistry 
and Industry, 14:555-560. 

Daniell H. Guda C. McPherson DT, XU J. Zhang X. Urry DW (1997) Hyper expression of 
an environmentally friendly synthetic polymer gene. Meth Mol Biol 63:359-371 . 



WO 01/72959 



37 



— PCTAJS01/06288 



Daniell H. Krishnan M. McFadden BA (1991) Expression of B-glucuronidase gene in 
different cellular compartments following biolistic delivery of foreign DNA into 
wheat leaves and calli. Plant Cell Reports 9:615-619. 

Daniell H. Krishnan M. Umabai U. Gnanam A (1986) An efficient and prolonged in vitro 
translational system from cucumber ecoplasts. Biochem. Biophys. Res. Comun 
135:48-255. 

Daniell H. McFadden BA (1987) Uptake and expression of bacteria and cyanobacterial 
genes by isolated cucumber etioplasts. Proc. Natl. Acad. ScL USA 84 49-6353. 

Daniell H. McFadden BA (1988) Genetic Engineering of plant chloroplasts. United States 
Patents 5,932,479; 5,693,507 

Daniell H. Ramanujan P. Krishnan M. Gnanam A. Rebeiz CA (1983) in vitro synthesis of 
photosynthetic membranes: I. Development of photosystem I activity and cyclic 
phosphorylation. Biochem. Biophys. Res, Comun. 111:740-749. 

Daniell H. Rebeiz CA (1982) Chloroplast culture IX: Chlorophyl(ide): A biosynthesis in 
vitro at rates higher than in vivo. Biochem. Biophys. Res, Comun 106:466-471 . 

Daniell H. Vivekananda J. Neilsen B. Ye GN: Tewari KK. Sanford JC (1990) Transient 
foreign gene expression in chloroplasts of cultured tobacco cells following biolistic 
delivery of chloroplast vectors. Proc. Natl. Acad. Sci USA 87:88-92. 

Davidson, MB (1998) Diagnosis and classification of diabetes mellitus in "Diabetes 
Mellitus-Diagnosis and Treatment. Pp. 1-16. 4 th edition., W.B. Saunders Co., 
Philadelphia, PA. 

Dertzbaugh MT, Elson CO (1993) Comparative effectiveness of the cholera toxin B subunit 
and alkaline phosphatase as carriers for oral vaccines. Infect. Immun. 61 : 48-55. 



WO 01/72959 



38 



PCT/USO 1/06288 



Drescher DF, Follmann H. Haberlein I (1998: Sulfitolysis and thioredoxin-dependent 
reduction revea] the presence of a structural disulfide bridge in spinach chloroplast 
fructose-1.6-bisphospate. FEBS Letters 424:109-112. 

During K. Hippe S. Kreuzaler F. Schell J (1990) Synthesis and self-assembly of a functional 
monoclonal antibody in transgenic Nicotiana tabacum. Plant Molecular Biology. 
15:281-293. 

Edwards K. Johnstone C. Thompson C (1991) A simple and rapid method for preparation 
of plant genomic DNA for PGR analysis. Nucleic Acids Res. 19:1349. 

Eibl C. Zou Z. Beck A. Kim M. Mullet J. Koop UH 9 1 999) In vivo analysis of plastid psbA, 
rbcL and rp!32 UTR elements by chloroplast transformation: tobacco plastid gene 
expression is controlled by modulation of transcript levels and translation efficiency. 
The Plant Journal 19: 333-345. 

Gill DM (1976) The arrangement os subunits in cholera toxin. Biochemistry 15: 1242-1248. 

Goeddel DV, Kleid DG, Bolivar F, Heyneker HL. Yansura DG, Crea R, Hirose T, 
Kraszewski A. Italkura K. Riggs AD (1979) Expression m Escherichia coli of 
chemically synthesized genes for human insulin. Proc. Natl. Acad. Sci. 76: 106-1 10. 

Goldberg AL, Goff SA (1986) Maximizing Gene Expression. Reznikoff and Gold, eds.pp. 
287 311, Butterworfh Publishers, Stoneham, MD. 

Guda C. Zhang X. McPherson DT, XU J. Cherry J. Uny DW, Daniell H. 91995) 
Hyperexpression of an environmentally friendly synthetic gene. Biotechnol Lett 
17:745-750. 

Guda C. Lee SB, Daniell H 92000) Stable expression of biodegradable protein based 
polymer in tobacco chloroplasts. Plant Cell Rep. 19:257-262. 



WO 01/72959 



39 



PCT/US01/06288 



Guriby P (1978) Bacteria directed to produce insulin in test application of genetic code. J. 
AM. Med. Assoc. 240(16): 1697-1698. 

Hall SS (1988) Invisible Frontiers - The Race to Synthesize a Human Gene. Atlantic 
Monthly Press, New York, NY. 

Hancock WW, Sayegh MH, Weiner HL et al. (1993) Oral, but not intravenous, alloantigen 
prevents accelerated, allograft rejection by selective intragraft Th2 cell cativation. 
Transplanation 55: 1 112-18. 

Haq T.A. Mason HS. Clements JD. Aintzen C. et al (1995 Oral immunization with a 
recombinant bacterial antigen produced in transgenic plants. Science 268:714-716 

Herzog RW Singh NK Urry DW Daniell H (1997) Synthesis of a protein based polymer 
(Elastomer) gene in Aspergillus nidulans. Applied Microbiology & Biotechnology 
47:368-372. 

Itakura K. Hirose T. Crea R. Riggs A.D. Heyneker HL Bolivar F. Boyer HW (1977) 
Expression in Escherichia coli of a chemically synthesized gene for the hormone 
somatostatin. Science. 198:1056-1063. 

Ido Y. Vindigni A. Chang K. Stramm L. Chance R. Heath WF, DiMarchi RD, DiCera E. 
Williamson JR (1997)Prevention of vascular and neural dysfunction in diabetic rats 
by C-peptide. Science 277:563-566. 

Jones JN (1990) Production of human calcitonin by recombinant DNA technology. 
"Fundamentals of Protein Technology" (Stein, S. ed), pp. 171-180, Xoma Corp., 
Berkely, CA. 



WO 01/72959 



40 



"PCT/US01/06288 



Khoury SJ Lider O. Weiner HL et al. (1990) Suppression of experimental auto immune 
encephalomyelitis by oral administration of myelin basic protein. Cell Immunol. 
131:302-10. 

Kim J-S, Raines RT (1993) Ribonuclease S-peptide as a carrier in fusion proteins. Protein. 
Sci. 2:348-356. 

Kota M. Daniell H. Varma S. Garczynski F. Gould F. Moar WJ (1999) Overexpression of 
the Bacillus thuringiensis Cry2A protein in chloroplasts confers resistance to plants 
against susceptible and Bt-resistant insects. Proc. Natl. Acad. Sci. USA 96:1840- 
1845. 

Laemmli UK (1970) Cleavage of structural proteins during the assembly of the head of 
bacteriophage T4. Nature 227:680-685. 

Lebens M. Holmgren J. (1994) Mucosal vaccines based on the use of Cholera Toxin B 
subunit as immunogen and antigen carrier. Recombinant Vectors in Vaccine 
Development [Brown F (ed)], 82: 215-227. 

Lee C Levin A. Branton D. (1987) Copper staining: A five-minute protein stain for sodium 
dodecyl sulfate-polyacrylamide gels. Anal Biochem 166:308-312. 

McBride KE Schaaf DJ. Daley M. Stalker DM (1994) Controlled expression of plastid 
transgenes in plants based on a nuclear encoded and plastid targeted T7 RNA 
polymerase. Proc. Natl. Acad. Sci. USA 91:7301-7305 

McBride KE Svab Z, Schaaf DJ, Hogen PS, Stalker DM, MaligaP 91995) Amplification of 
a chimeric Bacillus gene in chloroplasts leads to extraordinary level of an insecticidal 
protein in tobacco. Bio/technology 13:362-365. 



~WO 01/72959 



41 



~PCTAJS01/06288 



McKenzie SJ and Halsey JF (1984) Cholera toxin B subunit as a carrier protein to stimulate 
a mucosal immune response. Journal of Immunology. 133:1818-24. 

McPherson DT, Morrow C. Mineham DJ, Wu J. Hunter E. Urry DW 9 1992) Production and 
purification of a recombinant elastomeric polypeptide. G (IVPGVG) 19-VPGV from . 
Escherichia coli. Biotechnology Prog. 8:317-322. 

McPherson DT Xu J Urry DW (1996) Product purification by reversible phase transition 
following Escherichia coli expression of genes encoding up to 251 repeats of the 
elastomeric pentapeptide GVGVP. Protein Expression and Purification 7:51-57. 

Ma S-W, Zhao D-L. Yin Z-Q. Mukherjee R. Singa B. Qin H-Y et al. (1997) Transgenic 
plants expressing autoantigens fed to mice to induce oral tolerance. Transgenic Res. 
3:793-796. 

Marina CV et al. (1988) An Escherichia coli vector to express and purify foreign proteins 
by fusion to and separation from maltose binding protein. Gene 74:365-373. 

Mason HS Ball JM Anrtzen CJ et al. (1996), Expression of Norwalk virus capsid protein in 
transgenic tobacco and potato and its oral immunogenicity in mice. Proc. Nat. Acad. 
Sci. USA 93:5335-40. 

Mathiowitz E. Jacob JS Jong YS Carino GP. Chickering DE. Chaturvedi P. Santos CA 
VijayarahauauK. Montgomery S. BassettM. Morrell C. (1997) Biologically erodible 
microspheres as potential oral drug delivery systems. Nature 386:410-414. 

May GD. Mason HS Lyons PC (1996) Application of transgenic plants as production 
systems for pharmaceuticals in ACS symposium series 647 . Fuller et al eds. Chapter 
13. 196-204. 



WO 01/72959 



42 



PCT/US01/06288 



Mekalanos JJ Sadoff FC ( 1 979) Cholera vaccines: Fighting an ancient scourge. Science 265 : 
1387-1389 

Meyer DE Chilkoti A ( 1 999) Purification of recombinant proteins by fusion with thermally- 
responsive polypeptides. Nature Biotechnology 17: 1 1 12-1 115. 

Miller A. Weiner HL et aL (1 992) Suppressor T cells generated by oral tolerization to myelin 
basic protein suppress both in vitro and in vivo immune responses by the release of 
transforming growth factor B after antigen specific triggering. Proc Nat. Acad. ScL 
USA 89:421-5. 

Mor TS, Palmer KE et aL (1998) Perspective: edible vaccines - a concept coming of age. 
Trends In microbiology 6:449-453 

Nilsson J, Stahl S. Lundeberg J. Uhlen M. Nygren PA (1997) Affinity fusion strategies for 
detection, purification, and immobilization of recombinant proteins. Protein Expr. 
Purif. 11:U16. 

Oakly WG, Pyke DA, Taylor KW 9 1973) Biochemical basis of Diabetes. In "Diabetes and 
It's Management", pp. 1-14, 2 nd edition, Blackwell Scientific Publications. Osney 
Mead 5 Oxford. 

Ong E et aL (1989) The cellulose-binding domains of cellulases: tools for biotechnology. 
Trends BiotechnoL 7:239-243. 

Reulland E. Miginiac-Maslow M (1999) Regulation of chloroplast enzyme activities by 
thioredoxins; activation or relief from inhibition. Trends in Plant science 4:136-141. 

Richter L. Mason HS 3 Arntzen CJ (1996) Transgenic plants created for oral immunization 
against diarrheal diseases. Journal of travel medicine. 3:52-56. 



WO 01/72959 



43 



PCT7US01/06288 



Rogers SO, Bendich AJ (1988) In: Gelvin SB Schilperoot RA (eds) Plant molecular biology 
manual, Kluwer Academis Publishers, Dordrecht, Netherlands, pp. A6:l-10. 

Roy H (1989) Rubisco assembly; amodel system for studying the mechanism of chaperonin 
action. Plant Cell. 1:1035-1042 

Sambrook J. Fritch EF, Maniatis T (1989) Molecular cloning. Cold Spring Harbor Press, 
cold Spring Harbor, New York. 

Sayegh MH, Khoury SJ 5 Weiner HL et al (1992) Induction of immunity and oral tolerance 
with polymorphic class II major histocompatibility complex allopeptides in the rat. 
Proc. Nat. Acad. Sci. USA : 89:7762-6 

Schmidt M. Babu KR Khanna N. Marten S. Rimas U. (1999) Temperature induced 
production of recombinant human insulin in high density cultures of recombinant 
E.coli. Biotechnology 68: 71-83. 

Schonberger O. Hirst T. R and Pines O (1991) Targeting and assembly of an oligomeric 
bacterial enterotoxcid in the endoplasmic reticulum of Sacharomyces cervisiae. 
Molecular Microbiology 5:2663-2671. 

Smith DB, Johnson KS (1988) Single-step purification of polypeptides expressed in 
Escherichia coli as fusion with glutathione S-transferase, Gene 67:31-40. 

Smith PA et al. (1998) A plasmid expression system for quantitative in vivo biotinylation of 
thioredoxin fusion proteins in Escherichia coli. Nucleic Acids Res. 26:1414-1420. 

Smith MC Furman TC, Ingolia TD, Pidgeon C. (1988) Chelating peptide-immobilized metal 
ion affinity chromatography. J. Biol. Chem. 263:7211-7215. 



WO 01/72959 



44 



PCT/US01/06288 



Sidorov VA, Kasten D, Pang SZ, Hajcukiewicz PTJ, Staub JM, Nehra NS (1999) Stable 
chloroplast transformation in potato use of green fluorescent protein as a plastid 
marker. Plant Journal 19:209-216. 

Steiner DF, ArquUa ER, Lerner J, Martin DB ( 1 978) Recombinant DNA Research, Diabetes. 
27:877-878 

Su X, Prestwood AK, McGraw RA (1992) Production of recombinant percine tumor 
necrosis factor alpha in a novel E, coli expression system. Biotechniques 13 :756-62, 

Sun SB, Holmgren J, Czerkinsky C (1994) Cholera toxin B subunit: an efficient 
transmucosal carrier-delivery system for induction of peripheral immunological 
tolerance. Proc. Natl. Acad. Sci. USA. 9:10795-10799. 

Sun JB, Rask C. Olsson T, Holmgren J, Czerkinsky C (1996) Treatment of experimental 
autoimmune encephalomyelitis by feeding myelin basic protein conjugated to cholera 
toxin B subunit Proc. Natl. Acad. Sci. US 93:7196-9201. 



Svab Z. Maliga P ( 1 993) High frequency plastid transformation in tobacco by selection for 
a chimeric aadA gene. Proc. Natl. Acad. Sci. USA 90:913-917. 

Tsao KW, deBarbieri B. Hanspeter M. Waugh DW (1996) A versatile plasmid expression 
vector of the production of biotinylated proteins by site-specific enzymatic 
modification in Escherichia coli. Gene 69:59-64. 

Thanavala, y, Yang Y, Lyons P. et al (1995) Immunogenicity of transgenic plant derived 
hepatitis B surface antigen, Proc. Nat. Acad. Sci. USA 92:3358-3361. 

Trentham DR, Weiner HL et al (1993) Effects of oral administration of Type II collagen on 
rheumatoid arthritis. Science 261:1727-30. 



WO 01/72959 



45 



PCT/US01/06288 



Urry DW (1995) Elastic biomolecular machines. Scientific American. 272: 64-69. 

Urry DW (1991) Thermally Driven Self-assembly. Molecular Structuring and Entropic 
Mechanisms inElastomeric Polypeptides in cc Molecular Conformation and Biological 
Interactions" (Balaram, P., and Ramasashan S., Eds.) Pp. 555-583. Indian Acad. Of 
ScL, Bangalore, India. 

Urry DW, McPherson J., Xu J. ? Gowda DC, Jing N. Parker TM, Daniell H. Guca C. (1996) 
Protein Based Polymeric Materials (Synthesis and Properties) in ftC Polymeric Materials 
Encyclopedia". (Solomone ed.) Vol. 9. Pp 2645-2699, CRC Press. 



—WO 01/72959 



46 



PCT/US01/06288 



1052-P-00 

EXPRESSION OF THE NATIVE CHOLERA 
TOXIN B SUBINET GENE AS OLIGOMERS IN 
TRASNGENIC TOBACCO CHLOROPLASTS 



WO 01/72959 



47 



PCT/US01/06288- 



EXPRESSIONS THE NATIVE CHOLERA TOXIN B SUBUNIT GENE 
AS OLIGOMERS IN TRANSGENIC TOBACCO CHLOROPLASTS 

5 FIELD OF THE INVENTION 

This invention relates to expression of native cholera toxin B subunit gene as' 
oligomers in transgenic plant chloroplasts, particularly, in transgenic tobacco chloroplasts. 

BACKGROUND 

10 Pharmacologically important proteins are increasingly being expressed in plants as 

an economical alternative to conventional protein production methods. Transgenic plants 
expressing recombinant proteins and biologically active peptides such as vaccines, growth 
factors, hormones, monoclonal antibodies and enzymes have been reported (1). 

Proteins from different sources of a wide range have been expressed in nuclear 

15 transgenic plants. Protein accumulation levels of recombinant enzymes, like phytase and 
xylanase were high in nuclear transgenic plants (14% and 4. 1% of total soluble tobacco leaf 
protein respectively). This may be because their enzymatic nature made them more resistant 
to proteolytic degradation. Most nuclear transgenic plants express low levels of recombinant 
protein of human, viral or bacterial origin. Human proteins arc expressed at levels ranging 

20 from as low as 0.000017% of fresh weight (human 0 interferon expressed in tobacco) to a 
high of 0. 1% of soluble seed protein (human enkephalin expressed in arabidopsis seeds, 2). 
The Norwalk virus capsid protein expressed in potatoes caused oral immunization when 
consumed, although food expression levels were low, maximizing at 0.3% of total soluble 
protein (3). 

25 

SUMMARY OF THE INVENTION 

This invention includes expression of native cholera toxin B subunit gene as 
oligomers in transgenic tobacco chloroplasts which may be utilized in connection with large- 
scale production of purified CTB, as well as an edible vaccine if expressed in an edible plant 
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or as a transmucosal carrier of peptides to which it is fused to either enhance mucosal 
immunity or to induce oral tolerance of the products of these peptides. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 Fig. 1. pLD-LH-CTB vector and PCR analysis of control and chloroplast 

transformants. A. The perpendicular dotted line shows the vector sequences that are 
homologous to native chloroplast DNA, resulting in homologous recombination and site 
specific integration of the gene cassette into the chloroplast genome. Primer landing sites 
are also shown. B. PCR analysis: 0.8% agarose gel of PCR products using total plant DNA 

10 as template. 1 kb ladder (lane 1); Untransformed plant (lane 2); PCR products with DNA 
template from transgenic lines 1-10 (lanes 3 -12). 

Fig. 2. Western blot analysis of CTB expression in E.coli and chloroplasts. 

Blots were detected using rabbit anti-cholera serum as primary antibody and alkaline 
phosphatase labeled mouse anti-rabbit IgG as secondary antibody. A. E.coli protein analysis: 

15 Purified bacterial CTB, boiled (lane 1); Unboiled 24 h and 48 h transformed (lanes 2 & 4) 
and untransformed (lanes 3 & 5) E. coli cell extracts. Plant protein analysis: B. Color 
Development detection: Boiled, untransformed protein (lane 1); Boiled, purified CTB 
antigen (lane 2): Boiled, protein from 4 different transgenic lines (lanes 3 - 6). C. 
Chemiluminescent detection: Plant protein- Untransformed, unboiled (lane 1); 

20 Untransformed, boiled (lane 2); Transgenic lines 3 & 7, boiled (lanes 3 & 5), Transgenic line 
3, unboiled (lane 4); Purified CTB antigen boiled (lane 6), unboiled (lane 7); Marker (lane 
8). 

Fig. 3. Southern blot analysis of T 0 and T! plants. A. Untransformed and 

transformed chloroplast genome: Transformed and untransformed plant DNA was digested 
25 with Bglll and hybridized with the 0.81 kb probe that contained the chloroplast flanking 
sequences used for homologous recombination. Southern Blot results of To lines (B) 
Untransformed plant DNA (lane 1); Transformed lines DNA (lanes 2-4) and T t lines (C) 
Transformed plant DNA (lanes 1-4) and Untransformed plant DNA (lane 5). 

Fig. 4. A. Plant phenotypes; 1 : Confirmed transgenic line 7; 2: Untransformed 
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plant B. 10-day-old seedlings of T, transformed (1, 2 & 3) and untransforrned plant (4) 
plated on 500mg/L spectinomycin selection medium. 

Fig. 5. A. CTB ELISA quantification: Absorbance of CTB-antibody complex 

in known concentrations of total soluble plant protein was compared to absorbance of known 
5 concentration of bacterial CTB-antibody complex and the amount of CTB was expressed as 
a percentage of the total soluble plant protein. Total soluble plant protein from young, 
mature and old leaves of transgenic lines 3 and 7 was quantified. B. CTBGM iGanglioside 
binding ELISA assays: Plates coated first with GMj gangliosides and BSA respectively, 
were plated with total soluble plant protein from lines 3 and 7, untransforrned plant total 
1 0 soluble protein and purified bacterial CTB and the absorbance of the GMiganglioside-CTB- 
antibody complex in each case was measured. 

DETAILED DESCRIPTION 

Bacterial antigens like the B subunit proteins, CTB and LTB, which are two 
15 chemically, structurally and immunologically similar candidate vaccine antigens of 
prokaryotic enterotoxins, have been expressed in plants. CTB is a candidate oral subunit 
vaccine for cholera that causes acute watery diarrhoea by colonizing the small intestine and 
producing the enterotoxin, cholera toxin (CT). Cholera toxin is a hexameric AB 5 protein 
consisting of one toxic 27 kDa A subunit having ADP ribosyl transferase activity and a 
20 nontoxic pentamer of 1 1 .6 kDa B subunits (CTB) that binds to the A subunit and facilitates 
its entry into the intestinal epithelial cells. CTB when administered orally is a potent 
mucosal immunogen, which can neutralize the toxicity of the CT holotoxin by preventing 
it from binding to the intestinal cells (4). This is believed to be a result of it binding to 
eukaryotic cell surfaces via G MI gangliosides, receptors present on the intestinal epithelial 
25 surface, eliciting a mucosal immune response to pathogens and enhancing the immune 
response when chemically coupled to other antigens (5,6). 

Native CTB and LTB genes have been expressed at low levels via the plant nucleus. 
Since, both CTB and LTB are AT-rich compared to plant nuclear genes, low expression was 
probably due to a number of factors such as aberrant mRNA splicing, mRNA instability or 
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. inefficient codon usage. To avoid these undesirable features synthetic "plant optimized" 
genes encoding LTB were created and expressed in potato, resulting in potato tubers 
expressing up to 10 - 20 jug of LTB per gram fresh weight (7). However, extensive codon 
modification of genes is laborious, expensive and often not available due to patent 
5 restrictions. One of the consequences of these constitutively expressed high LTB levels, was 
the stunted growth of transgenic plants that was eventually overcome by tissue specific 
expression in potato tubers. The maximum amount of CTB protein detected in auxin 
induced, nuclear transgenic potato leaf tissues was approximately 0.3% of the total soluble 
leaf protein when the native CTB gene was fused to an endoplasmic reticulum retention 
10 signal, thus targeting the protein to the endoplasmic reticulum for accumulation and 
assembly (8). 

Increased expression levels of several proteins have been attained by expressing 
foreign proteins in chloroplasts of higher plants (9 - 11). Human somatotropin has been 
expressed in chloroplasts with yields of 7% of the total soluble protein (12). The 

15 accumulation levels of the Bt Cry2Aa2 operon in tobacco chloroplasts are as high as 46.1 
% of the total soluble plant protein (1 3). This high level of expression is attributed to the 
putative chaperoning, orf 1 and orf 2, upstream of Cry2Aa2 in the operon that may help to 
fold the protein into a crystalline form that is stable and resistant to proteolytic degradation. 
Besides the ability to express polycistrons, yet another advantage of chloroplast 

20 transformation I, is the lack of recombinant protein expression in pollen of chloroplast 
transgenic plants. As there is no chloroplast DNA in pollen of most crops, pollen mediated 
outcross of recombinant genes into the environment is minimized (10 - 15), 

Since the transcriptional and translational machinery of plastids is prokaryotic in 
origin and the N. tabaccum chloroplast genome has 62.2% AT content, it was likely that 

25 native CTB genes would be efficiently expressed in this organelle without the need for codon 
modification. Also, codon comparison of the CTB gene with psbA, the major translation 
product of the chloroplast, showed 47% homology with the most frequent codons of the 
psbA gene. Highly expressed plastid genes display a codon adaptation, which is defined as 
a bias towards a set of codons which are complimentary to abundant tRNAs (16). Codon 
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analysis showed that 34% of the codons of CTB are complimentary to the tRNA population 
in the chloroplasts in comparison with 5 1 % of psbA codons that are complimentary to the 
chloroplast tRNA population. 

Also, stable incorporation of the CTB gene into the precise location between the trnA 
5 and trnl genes of the chloroplast genome by homologous recombination, should eliminate 
the 'position effect' frequently observed in nuclear transgenic plants. This should allow 
uniform expression levels in different transgenic lines. Amplification of the 
transgene, should result in a high level of CTB gene expression since each plant cell contains 
up to 50,000 copies of the plastid genome (17). Another significant advantage of the 
10 production of CTB in chloroplasts, is the ability of chloroplasts to form disulfide bridges 
(12,18,19) which are necessary for the correct folding and assembly of the CTB pentamer 
(20). 

In this study, we report the integration of the CTB gene into the inverted repeat region 
of the tobacco chloroplast genome, allowing 2 copies / chloroplast genome of the CTB gene 
1 5 per cell, resulting in chloroplasts accumulating high levels of CTB . This eliminates the need 
to modify the CTB gene for optimal expression in plants. 

Construction of the Chloroplast Expression Vector pLD-CTB: The leader sequence (63 
bp) of the native CTB gene was deleted and a start codon was introduced at the 5* end. 
Primers were designed to introduce an rbs site 5 bases upstream of the start codon. The CTB 

20 PCRproduct was then cloned into the multiple cloning site of the pCR2. 1 vector (Invitrogen) 
and subsequently into the chloroplast expression vector pLD-CtV2 using suitable restriction 
sites. Restriction enzyme digestions of the pLD-LH- CTB vector were done to confirm the 
correct orientation of the inserted fragment- 
Expression of the pLD-LH- CTB vector was tested in E. coli XL- 1 Blue MRF TC strain 

25 before tobacco transformation. E, coli was transformed by standard CaCl 2 transformation 
procedures. Transformed E. coli (24 and 48 hrs culture in 100ml TB with 100 jug/wl 
ampicillin) and untransformed E. coli (24 and 48 hrs culture in 100 ml TB with 12.5 Mg/ml 
tetracycline) were centrifuged for 15 min. The pellet obtained was washed with 200mM 
Tris-CI twice, resuspended in 500 /A extraction buffer (200mM Tris-Cl, pH 
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8.0, lOOmM NaCl, lOmM EDTA, 2mM PMSF) and sonicated. To aliquots of 100 jul 
transformed and untransformed sonicates [containing 50- 100 jug of crude protein extract 
as determined by Bradford protein assay (Bio-rad)] and purified CTB (100 ng, Sigma), 2X 
SDS sample buffer was added. These sample mixtures were loaded on a 1 5% sodium SDS 
5 -PAGE gel and electrophoresed at 200v for 45 min. in Tris-glycine buffer (25mM Tris, 250 
mM glycine, pH 8.3, 0.1% SDS). The separated protein was transferred to a nitrocellulose 
membrane by electroblotting at 70v for 90 min. 

Immunoblot Analysis of CTB Production in E. coli: Nonspecific antibody reactions were 
blocked by incubation of the membrane in 25 ml of 5% non-fat dry milk in TBS buffer for 

10 2 h on a rotary shaker (40 rpm) followed by washing in TBS buffer for 5 min. The 
membrane was incubated for lh in 30 ml of a 1 :5000 dilution of rabbit anti-cholera antiserum 
(Sigma) in TBST (TBS with 0.05% Tween-20), containing 1% non-fat dry milk, followed 
by washing thrice in TBST. Incubation for an hour at room temperature in 30 ml of a 
1:10,000 dilution of alkaline phoshphatase conjugated mouse anti-rabbit IgG. (Sigma) in 

15 TBST, washing thrice in TBST and once with TBS was followed by incubation in the 
Alkaline Phoshphatase Color Development Reagents, BCIP/NBT in AP color development 
buffer (Bio-Rad) for an hour. 

Bombardment and Regeneration of Chloroplast Transgenic Plants: Fully expanded, dark 
green leaves of about two-month old Nicotiana tabacum var. Petit havana plants were 

20 placed abaxial side up on filter papers in RMOP (21) petridish plates. Microprojectiles 
coated with pLD-LH-CTB DNA were bombarded into the leaves using the biolistic device 
PDSIOOO/He (Bio-Rad), as described by Daniell (21). Following incubation at 24 °C in the 
dark for two days, the bombarded leaves were cut into small (-5mm 2 ) pieces and placed 
abaxial side up (5 pieces/plate) on selection medium (RMOP containing 500 mg/L 

25 spectinomycin dihydrochloride). Spectinomycin resistant shoots obtained after about 1-2 
months were cut into small pieces (-2mm 2 ) and placed on the same selection medium. 
PCR Analysis: Total plant DNA from putative transgenic and untransformed plants was 
isolated using the DNeasy kit (Qiagen). PCR primers 3P (S'AAAACCCGTCCTCAGT 
TCGG ATTGC-3 ! ) and 3M (5 f -CCGCGTTGTTTCATCAAGCCTTACG-30 were used for 
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PCR on putative transgenic and untransformed plant total DNA. Samples were carried 
through 30 cycles using the following temperature sequence: 94°C for 1 min, 62°C for 
1.5minand72°Cfor2min. Cycles were preceded by denaturation for 5 min at 94° C. PCR 
confirmed shoots from the second selection were transferred to rooting medium (MSO 
5 medium containing 500 mg/L spectinomycin). 

Southern Blot Analysis: Ten micrograms of total plant DNA (isolated using DNeasy kit) 
per sample were digested with Bglll, separated on a 0.7% agarose gel and transferred to a 
nylon membrane. A 0. 8 kb fragment probe, homologous to the chloroplast border sequences, 
was generated when vector DNA was digested with Bglll and BamHI. Hybridization was 
10 performed using the Ready To Go protocol (Pharmacia). Southern blot confirmed plants 
were transferred to pots. On flowering, seeds obtained from T 0 lines were germinated on 
spectinomycin dihydrochloride-MSO media and Tj seedlings were grown in bottles 
containing MSO with spectinomycin (500 mg/L) for 2 weeks. The plants were later 
transferred to pots. 

15 Western Blot Analysis of Plant Protein: Transformed and untransformed leaves (100 mg) 
were ground in liquid nitrogen and resuspended in 500 /A of extraction buffer (200mM Tris- 
Cl, pH8.0, 100 mM NaCl, lOmM EDTA, 2 mM PMSF). Leaf extracts (100 - 120 jug as 
determined by Lowry assay) were boiled (4 min) and unboiled in reducing sample buffer 
(BioRad) and electrophoresed in 12% polyacrylamide gels using the buffer system of 

20 Laemmli (22). The separated proteins were transferred to a nitrocellulose membrane by 
electroblotting at 85v for lh. The immunoblot detection procedure was similar to that done 
for E. coli blots described above. For the chemiluminescent detection, the S. Tag™ AP 
Lumiblot kit (Novagen) was used. 

ELISA Quantification of CTB: Different concentrations (100 jul/well) of 100 mg leaves 
25 (transformed and untransformed plants) ground with liquid nitrogen and resuspended in 
bicarbonate buffer, pH 9.6 (15mM N^CC^, 35mM NaHC0 3 ) were bound to a 96 well 
polyvinyl chloride microliter plate (Costar) overnight at 4 ° C . The background was blocked 
with 1% Bovine serum albumin (BSA) in 0.01M phosphate buffered saline (PBS) for 2h at 
37 ° C 3 washed thrice with washing buffer, PBST (PBS and 0.05% Tween 20) and rabbit anti- 
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cholera serum diluted 1:8,000 in PBST containing 0.5% BSA was added and incubated for 
2h at 37°C. The wells were washed and incubated with 1:50,000 mouse anti rabbit IgG- 
alkaline phosphatase conjugate in PBST containing 0.5% BSA for 2h at 37°C. The plate 
was developed with Sigma Fast pNPP substrate (Sigma) for 30 minutes at room temperature 

5 and the reaction was ended by addition of 3N NaOH and plates were read at 405 nm. 

GM t Ganglioside Binding Assay: To determine the affinity of chloroplast derived CTB for 
GM r gangliosides, microliter plates were coated with monosialoganglioside-GMx (Sigma) 
(3.0 yug/ml i* bicarb, buffer) and incubated at 4°C overnight. As a control, BSA 
(3.0 yug/ml in bicarb, buffer) was coated on some wells. The wells were blocked with 1% 

10 BSA in PBS for 2h at 37°C, washed thrice with washing buffer, PBST and incubated with 
dilutions of transformed plant protein, untransformed plant protein and bacterial CTB in 
PBS. Incubation of plates with primary and secondary antibody dilutions and detection was 
similar to the CTB ELISA procedure described above. 

pLD-LH-CTB vector construction and E. coli expression: The pLD-LH-CTB vector 
15 integrates the genes of interest into the inverted repeat regions of the chloroplast genome 
between the trnl and trnA genes. Integration occurs through homologous recombination 
events between the trnl and trnA chloroplast border sequences of the vector and the 
corresponding homologous sequences of the chloroplast genome as shown in Fig. 1A. The 
chimeric aminoglycoside 3' adenyltransferase (aadA) gene that confers resistance to 
20 spectinomycin-streptomycin and the CTB gene downstream of it are driven by the 
constitutive promoter of the rRNA operon (Prrn) and transcription is terminated by the 
psbA3' untranslated region. Since the protein synthetic machinery of chloroplasts is similar 
to that of E. coli (23), CTB expression of the pLD-LH-CTB vector in E. coli was tested. 
Western blot analysis of sonicated E. coli whole cell extract showed the presence of 1 1 kDa 
25 CTB monomers, similar to that obtained when purified commercially available CTB was 
treated in the same manner as shown in Fig. 2A Oligomeric expression of CTB was not 
observed in E. coli, as expected, due to the absence of a leader peptide sequence present in 
the native CTB gene that directs the CTB monomer into the periplasmic space allowing for 
concentration and oligomeric assembly. 
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Selection and Regeneration of Transgenic Plants: Bombarded leaf pieces when placed 
on selection medium continued to grow but were bleached. Green shoots emerged from the 
part of the leaf in contact with the medium. Five rounds of bombardment (5 leaves each) 
resulted in 68 independent transformation events. Each such transgenic line was subjected 
to a second round of antibiotic selection. These putative transformants were subjected to 
PGR analysis to distinguish from nuclear transformants and mutants. 
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Determination of Chloroplast Integration and Homoplasmy: PGR and Southern 
hybridization were used to determine integration of the CTB gene into the chloroplast 
genome. Primers, 3P and 3M, designed to confirm incorporation of the gene cassette into 
the chloroplast genome were used to screen putative transgenics initially. The primer, 3P, 
landed on tlie chloroplast genome outside of the chloroplast flanking sequence used for 
homologous recombination as shown in Fig. 1 A. The primer, 3M, landed on the aadA gene. 
No PCR product should be obtained if foreign genes are integrated into the nuclear genome 
or in mutants lacking the aadA gene. The presence of the 1.6kbPCRproductin9 ofthe 10 
putative transgenics screened, confirmed the site-specific integration ofthe gene cassette into 
the chloroplast genome. Database searches showed that no random priming took place as 
both the 3P and 3M primers showed no homology with other gene sequences. This is 
confirmed by the absence of PCR product in untransformed plants (Fig. IB). Similar 
strategy has been used successfully by us in order to confirm chloroplast integration of 
foreign genes (13,14,24,25). This screening is essential to eliminate mutants and nuclear 
transformants and saves space and labor of maintaining hundreds of transgenic lines. 

Southern blot analysis of three of the PCR positive transgenic lines was done to 
further confirm site specific integration and to establish copy number. In the chloroplast 
genome, Bgin sites flank the chloroplast border sequences 5 1 of 16SrRNA and 3 f ofthe trnA 
region as shown in Fig. 3A. A 6.17kb fragment from a transformed plant and a 4.47 kb 
fragment from an untransformed plant were obtained when total plant DNA from 
transformed and untransformed plants was digested with BgllL The blot of the digested 
products was probed with a 32 P random primer-labeled 0.81 kb trnl-trnA fragment. The 
probe hybridized with the control giving a 4.47 kb fragment as expected, while for the 
transgenic lines a 6 . 1 7 kb fragment was observed, indicating that all plastid genomes had the 
gene cassette inserted between the tml and trnA regions. The absence of a 4.47 kb fragment 
in transgenic lines indicates that homoplasmy has been achieved, to the detection level of 
a Southern blot. These results explain the high levels of CTB observed in transgenic tobacco 
plants. Southern blot confirmed plants transferred to pots were seen to have no adverse 
pleiotropic effects when compared to untransformed plants as shown in Fig.4A. Southern 
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blot analysis of Tj plants in Fig. 3C shows that all 4 transgenic lines analyzed maintained 
homoplasmy. . 

Immunoblot Analysis of Chloroplast Synthesized CTB: Anti-cholera toxin antibodies did 
not show significant cross-reaction with tobacco plant protein as can be seen in Fig. 2 C, 
lanes 1 & 2. Boiled and unboiled leaf hornogenates were run on 12% SDS PAGE gels. 
Unboiled chloroplast synthesized CTB protein appeared as compact 45 kDa oligomers as 
shown in Fig. 2C, lane 4 similar to the unboiled, pentameric bacterial CTB which appeared 
to have partially dissociated into tetramers, trimers and monomers upon storage at 4 °C over 
a period of several months from Fig. 2C, lane7. 
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While heat treatment (4 min. boiling) prior to SDS PAGE of pentameric bacterial 
CTB, gave CTB monomers predominantly, with some protein in the dimeric and trimeric 
form as shown in Fig. 2C, lane 6, chloroplast synthesized CTB dissociated into dimers and 
trimers only, when subjected to similar heat treatment as in Fig. 2C, lanes 3 & 5. These 
5 results are different from the heat induced dissociation of potato plant nucleus synthesized 
CTB; oligomers into monomers (8). A probable reason for this stability could be a more 
stable conformation of chloroplast synthesized CTB which maybe an added advantage in 
storage and administration of edible vaccines. Leaf homogetiates from four different 
transgenic plants showed almost similar expression levels of CTB protein (see Fig. 2B). 
10 This suggests very little clonal variation of CTB expression, as was confirmed later by 
ELISA quantification assays. Consistent expression levels of recombinant proteins in plants 
(as obtained for CTB in this research) may be essential for production of edible vaccines in 
plants. 

ELISA Quantification of CTB Expression: Comparison of the absorbance at 405nm of a 
15 known amount of bacterial CTB - antibody complex (linear standard curve) and that of a 
known concentration of transformed plant total soluble protein was used to estimate CTB 
expression levels. Optimal dilutions of total soluble protein from two transgenic lines were 
loaded in wells of the microliter plate. As reported previously (8), it was necessary to 
optimize the dilutions of total soluble protein, as levels of CTB protein detected varied with 
20 the concentration of total soluble protein, resulting in too high concentrations of total soluble 
protein inhibiting the CTB protein from binding to the wells of the plate. Both T 0 lines 
yielded CTB protein levels ranging between 3.5% to 4. 1 % of the total soluble protein (40 
^g of chloroplast synthesized CTB protein in 1 mg of total soluble protein) as shown in Fig. 
5 A. Also, estimation of CTB protein expression levels from different stages of leaves - 
25 young, mature and old determined that mature leaves have the highest levels of CTB protein 
expression. This is in accordance with the results obtained when similar experiments were 
performed when the Bt Cry2aA2 gene was expressed without the putative chaperonin genes, 
but contrary to results with the Bt Cry2aA2 operon, which showed high expression levels 
in older leaves, probably due to the stable crystalline structure (13). 
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GMj Ganglioside ELI S A Binding Assays : Both chloroplast synthesized and bacterial CTB 
demonstrated a strong affinity for GM1, - gangliosides (see Fig. 5B) indicating that 
chloroplast synthesized CTB conserved the antigenic sites necessary for binding of the CTB 
pentamer to the pentasaccharide GMJ. The GM^ binding ability also suggests proper folding 
5 of CTB molecules resulting in the pentameric structure. Since oxidation of cysteine residues 
in the B subunits is a prerequisite for in vivo formation of CTB pentamers (20), proper 
folding is a further confirmation of the ability of chloroplasts to form disulfide bonds. 

High levels of expression of CTB in transgenic tobacco did not affect growth rates, 
flowering or seed setting as has been observed in this laboratory, unlike previously reported 
10 for the synthetic LTB gene, constitutively expressed via the nuclear genome (7). 
Transformed plant seedlings were green in color while untransformed seedlings lacking the 
aadA gene were bleached white as shown in Fig. 4B when germinated on antibiotic medium. 

The potential use of this technology is three-fold. While, it can be used for large scale 
1 5 production of purified CTB, it can also be used as an edible vaccine if expressed in an edible 
plant or as a transmucosal carrier of peptides to which it is fused to, so as to either enhance 
mucosal immunity or to induce oral tolerance to the products of these peptides (5). Large- 
scale production of purified CTB in bacteria involves the use of expensive fermentation 
techniques and stringent purification protocols (26) making this a prohibitively expensive 
20 technology for developing countries. The cost of producing 1kg of recombinant protein in 
transgenic crops has been estimated to be 50 times lower than the cost of producing the same 
amount by E. coli fermentation, assuming that recombinant protein is 20% of total E.coli 
protein (27). Thus, isolation and lysis of CTB producing chloroplasts from chloroplast 
transformed plants could serve as a cost-effective means of mass production of purified CTB . 
25 If used as an edible vaccine, a selection scheme eliminating the use of antibiotic resistant 
genes should be developed. One such scheme uses the betaine aldehyde dehydogenase 
(BADH) gene, which converts toxic betaine aldehyde to nontoxic glycine betaine, an 
osmoprotectant (28). Also, several other 
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strategies have been proposed to eliminate antibiotic-resistant genes from transgenic plants 
(29). 

Transgenic potato plants that synthesize a CTB-insulin fusion protein at levels of up 
to 0.1% of the total soluble tuber protein have been found to show a substantial reduction 
5 in pancreatic islet inflammation and a delay in the progression of clinical diabetes (3 0) . This 
may prove to be an effective clinical approach for prevention of spontaneous autoimmune 
diabetes. Since, increased CTB expression levels have been shown to be achievable via the 
chloroplast genome through this research, expression of a CTB-proinsulin fusion protein in 
the chloroplasts of edible tobacco (L AMD) is currently being tested in our laboratory. While 
10 existing expression levels of CTB via the chloroplast genome are adequate for commercial 
exploitation, levels can be increased further (about 10 fold) by insertion of a putative 
chaperonin, as in the case of the Bt Cry2aA2 operon, (13) which likely aids in the 
subsequent purification of recombinant CTB due to crystallization. 
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Figure 6: 12% reducing PAGE, Chemiluminescent 
detection with rabbit anti-cholera scrum (1°) and AP 
labeled mouse anti-rabbit IgG (2°) antibodies. 
Untransforrned, boiledfl) and unboiled (2); Transformed, 
boiled (3&5) and unboiled (4);Purified CTB boiled (6)and 
unboiled (7); Marker (8). 

« HSA Nuclear transformation of potato plants 
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Figure7; A, B) reducing gels. l:markers, 2:Transgenic 
extract showing expression of light (A) and heavy chain 
(B) in chloroplasts, 3: Untransforrned, 4: Human IgA. C) 
non-reducing gel. 1. Transgenic extract showing assembly, 
2: Untransformed, 3: Human IgA. Blots A & C were 
detected with AP conjugated goat anti-human kappa 
antibody. Blot B was detected with AP conjugated goat 
anti-human IgA antibody. 




40.00 
30.00 
20.00 J 
10.00 



Relative 
frequency 000 
(%) 



,J2L 



~ 30 



Figure 8: Western Blot of transgenic potato tubers, cv 
Desiree. 30 |j.g of tuber protein was loaded per lane and 
probed with anti-HSA antibody. 1 : wild type; 2: 40 ng of 
pure HSA; 3-8:diferent trangenic lines, showing 
different levels of expression. 
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Figure 9: Frecuency histogram including percentage 
Kennebec and Desiree transgenic plants expressing 
different HSA levels. Results are shown as the 
percentages of transgenic plants (vertical axis) that 
express a specific level of HSA of the total soluble 
protein (horizontal axis). 

Codon composition and expression levels- 



Figura 10: Western Blot of £. coli protein 
extracts. 1: 50 ng pure HSA; 2: molecular weigh 
marker; 3: pLD-HSA (control without RBS); 4: 
PLD- 5'UTR-HSA; 5: pLD-RBS-HSA; 6: pLD- 
ORF1+2-HSA; 7: E. coli without pLD vector. 
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Table 1: Unmodified native codon composition and expression 
levels observed in transgenic chloroplasts. See section d) for 
details of AT content, %psbA optimal codons and % of codons 
that match the cp tRNA pool. TSP: % total soluble protein 
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Fieure J: Cry2A protein concentration determined by ELISA in 
transgenic leaves. Note 100-fold increase in protein accumulation m 
the presence of the putative chaperomn, ORF2. 



- Expression of a small (22aa^ peptid e in transgenic chloroplasts 



Figure 2 :Inmunogotd labeled electron microscopy 
of mature transgenic leaf. Cry2Aa2 crystals in a 
transgenic chloroptast expressing the cry2A 
operon. 



Bioassay P. aeruginosa 




Transgenic Untransformed 
Fieure 3 . Leaves were infected with 10 ul of 8x10 s , 
8x10*, SxlO 3 and 8X10 2 cells of P. syringae. Photos 
were taken 5 days after inoculation. I; 2 ., J*P °£ 
antimicrobial peptide (AMP) is required to kill 1000 
bacterial cells. Local concentration at the site of 
infection is estimated to be 200-800pg AMP. 




Transgenic Wild type Buffer only 
Figure 4. Total plant protein was mixed with*5uJ of niid- 
lo<* phase bacteria from overnight culture, incubated for 2 
hours at25°C at 125rpm and grown in LB broth overnight 
Based on minimum inhibitory concentration of 1-2 ug 
AMP/1000 bacterial cells, the expression level was 
calculated to be 2 1 .5-43% of the total soluble protein. 
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fieure 5' A) CTB ELISA quatification is shown as a percentage of the total soluble P^lS^iJ 0 ^^ 
S nroteih from young mature and old leaves of transgenic lines 3 and 7 was quantified. B) CTB-GM1 
^l&r^"^ys= Ptotes coated first with GM1 gangliosides and BSA were plated with total 
Sffiit protein from lines 3 and 7, untransformed plant total soluble protein and purified bacterial CTB. The 
absorbance or the GM 1 ganglioside-CTB antibody complex was measured. 
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PRODUCTION OF HUMAN SERUM ALBUMIN 
IN TRANSGENIC TOBACCO 

5 FIELD OF THE INVENTION 

This invention relates to production of high value pharmaceutical proteins in nuclear 
transgenic plants, particularly to production of human serum albumin in transgenic tobacco. 

BACKGROUND 

1 0 Human serum albumin (HS A) is a monomelic globular protein consisting of a single, generally 

nonglycosylated, polypeptide chain of 585 amino acids (66.5 KDa and 17 disulfide bonds) with no 
postranslational modifications. It is composed of three structurally similar globular domains and the 
disulfides are positioned in repeated series of nine lpop-link-loop structures centered around eight 
sequential Cys-Cys pairs. HS A is initially synthesized as pre-pro-albumin by the liver and released 

1 5 from the endoplasmatic reticulum after removal of the aminoterminal prepeptide of 1 8 amino acids. 
The pro-albumin is further processed in the Golgi complex where the other 6 aminoterminal residues 
of the propeptide are cleaved by a serine proteinase (1). This results in the secretion of the mature 
polypeptide of 585 amino acids. HS A is encoded by two codominant autosomic allelic genes. HS A 
belongs to the multigene family of proteins that include alpha-fetoprotein and human group-specific 

20 component (Gc) or vitamin D-binding family. HS A facilitates transfer of many ligands across organ 
circulatory interfaces such as in the liver, intestine, kidney and brain. In addition to blood plasma, 
serum albumin is also found in tissues. HS A accounts for about 60% of the total protein in blood 
serum. The concentration of albumin is 40 mg/ml in the serum of human adults. 

25 Medical applications of HSA: The primary function of HSA is the maintenance of colloid osmotic 
pressure (COP) within the blood vessels. Its abundance makes it an important determinant of the 
pharmacokinetic behavior of many drugs. Reduced synthesis of HSA can be due to advanced liver 
disease, impaired intestinal absorption of nutrients or poor nutritional intake. Increased albumin 
losses can be due to kidney diseases (increased glomerular permeability to macromolecules in the 

30 nephrotic syndrome), intestinal diseases (protein-losing enteropathies) or exudative skin disorders 
(burns). Catabolic states such as chronic infections, sepsis, surgery, intestinal resection, trauma or 
extensive burns can also cause hypoalbuminemia. HSA is used in therapy ofblood volume disorders, 
for example posthaemorrhagic acute hypovolemia or extensive burns, treatment of dehydration 
states, and also for cirrhotic and hepatic illnesses. It is also used as an additive in perfusion liquid for 

3 5 extracorporeal circulation. HSA is used clinically for replacing blood volume, but also has a variety 
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of non-therapeutic uses, includingits role as a stabilizer in formulations for othertherapeuticproteins. 
HS A is a stabilizer for biological materials in nature and is used for preparing biological standards and 
reference materials. Furthermore, HSA is frequently used as an experimental antigen, a cell-culture 
constituent and a standard in clinical-chemistry tests. 

5 

Expression systems for HSA: The expression and purification of recombinant HSA from various 
microorganisms has been reported previously (2-6), Saccharomyces cerevisiae has been used to 
produce HSA both intracellulary, requiring denaturation and refolding prior to analysis (7), and by 
secretion (8). Secreted HSA was equivalent structurally, but the recombinant product had lower 

1 0 levels of expression (recovery) and structural heterogeneity compared to the blood derived protein 
(9). HSA was also expressed in Kluyveromyces lactis, a yeast with good secretary properties 
achieving 1 g/liter in fed batch cultures (10). Ohtani et al (1 1) developed a HSA expression system 
using Pichia pastoris and established apurification method obtaining recombinant protein with similar 
levels of purity and properties as the human protein. In Bacillus subtilis, HSA could be secreted 

15 using bacterial signal peptides (4). HSA production in E. co/z was successful but required additional 
in vitro processing with trypsin to yield the mature protein (3). Sijmons et al. (12) expressed HSA 
in transgenic potato and tobacco plants. Fusion of HSA to the plant PR-S presequence resulted in 
cleavage of the presequence at its natural site and secretion of correctly processed HSA, that was 
indistinguishable from the authentic human protein. The expression was 0.0 1 4% of the total soluble 

20 protein. However, none of these methods have been exploited commercially. 

Challenges in commercial production of HSA: Albumin is currently obtained by protein 
fractionation from plasma and is the world's most used intravenous protein, estimated at around 500 
metric tons per year. Albumin is typically administered by intravenous injection of solutions 

25 containing 20% of albumin. The average dosage of albumin for each patient varies between 20-40 
grams/day. The consumption of albumin is around 700 kilograms per million habitants per year. In 
addition to high cost, HSA has the risk of transmitting diseases as with other blood-derivative 
products. The price of albumin is about $3.7/g. Thus, the market of this protein approximately 
amounts to 0.7 billion dollars per year in USA. Because of the high cost of albumin, synthetic 

30 macromolecules (like dextrans) are used to increase plasma colloidosmotic pressure. 

Commercial HSA is mainly prepared from human plasma. This source hardly meets the 
requirements of the world market. The availability of human plasma is limited and careful heat 
treatment of the productprepared must be performed to avoid potential contamination of the product 
by hepatitis, HIV and other viruses. The costs of HSA extraction from blood are very high. 

35 Innovative production systems are needed to meet the demands of the large albumin market with a 
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safe product at a low cost. Plant biotechnology offers the promise of obtaining safe and cheap 
proteins to be used to treat human diseases, 

Chloroplast genetic engineering: When we developed the concept of chloroplast genetic 
5 engineering (13,14), it was possible to introduce isolated intact chloroplasts into protoplasts and 
regenerate transgenic plants (15). Therefore, early investigations on chloroplast trmsformation 
focused on the development of in organello systems using intact chloroplasts capable of efficient and 
prolonged transcription and translation (16 - 18) and expression of foreign genes in isolated 
chloroplasts (19). However, after the discovery of the gene gun as a transformation device (20), it 

10 was possible to transform plant chloroplasts without the use of isolated plastids and protoplasts. 
Chloroplast genetic engineering was accomplished in several phases. Transient expression of foreign 
genes in plastids of dicots (21,22) was followed by such studies in monocots (23). Unique to the 
chloroplast genetic engineering is the development of a foreign gene expression system using 
autonomously replicating chloroplast expression vectors (21). Stable integration of a selectable 

15 marker gene into the tobacco chloroplast genome (24) was also accomplished using the gene gun. 
However, useful genes conferring valuable traits via chloroplast genetic engineering have been 
demonstrated only recently. For example, plants resistant to B.t. sensitive insects were obtained by 
integrating the crylAc gene into the tobacco chloroplast genome (25). Plants resistant to B.t 
resistant insects (up to 40,000 fold) were obtained by hyper-expression of the cry2A gene within the 

20 tobacco chloroplast genome (26). Plants have also been genetically engineered via the chloroplast 
genome to confer herbicide resistance and the introduced foreign genes were maternally inherited, 
overcoming the problem of out-cross with weeds (27). Chloroplast genetic engineering technology 
is currently being applied to other useful crops (14,28). 

25 Investigations in progress; A remarkable feature of chloroplast genetic engineering is the 
observation of exceptionally large accumulation of foreign proteins in transgenic plants, as much as 
46% of CRY protein in total soluble protein, even in bleached old leaves (29, see attached report De 
Cosa et al. 2001). Stable expression of a pharmaceutical protein in chloroplasts was first reported 
for GVGVP, a protein based polymer with varied medical applications (such as the prevention of 

30 post-surgical adhesions and scars, wound coverings, artificial pericardia, tissue reconstruction and 
programmed drug delivery (30)). Subsequently, expression of the human somatotropin via the 
tobacco chloroplast genome (31) to high levels (7% of total soluble protein) was observed. The 
following investigations that are in progress in our laboratory illustrate the power of this technology 
to express small peptides, entire operons, vaccines that require oligomeric proteins with stable 

35 disulfide bridges and monoclonals that require assembly of heavy/light chains via chaperonins. 
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Engineering novel pathways via the chloroplast genome: In plant and animal cells, nuclear 
mRNAs are translated monocistronically. This poses a serious problem when engineering multiple 
genes in plants (32). Therefore, to express thepolyhydroxybutyrate polymer or Guy's 13 antibody, 
single genes were first introduced into individual transgenic plants. Then, these plants were back- 
5 crossed to reconstitute the entire pathway or the complete protein (33,34). Similarly, in a seven year 
long effort, Ye et al. (22) recently introduced a set of three genes for a short biosynthetic pathway 
that resulted in (3-carotene expression in rice. In contrast, most chloroplast genes of higher plants are 
cotranscribed (32). Expression of polycistrons via the chloroplast genome provides a unique 
opportunity to express entire pathways in a single transformation event. We have recently used the 

1 0 Bacillus thuringiensis (B t) cry2Aa2 operon as a model system to demonstrate operon expression and 
crystalformationviathechloroplastgenome(29). Cry2Aa2 is the distal gene ofa three-gene operon. 
The ^immediately upstream of ctyl Aa2 codes for a putative chaperonin that facilitates the folding 
of crylAsOL (and other proteins) to form proteolytically stable cuboidal crystals (35). 

Therefore, the cry2Aa2 bacterial operon was expressed in tobacco chloroplasts to test the 

15 resultant transgenic plants for increased expression and improved persistence of the accumulated 
insecticidal protein(s). Stable foreign gene integration was confirmed by PGR and Southern blot 
analysis in T 0 and T t transgenic plants. Cry2Aa2 operon derived protein accumulated at 45.3% of 
the total soluble protein in mature leaves and remained stable even in oldbleached leaves (46 . 1 %)(see 
figure number 4 in attached article De Cosa et al. 2001 , 29). This is the highest level of foreign gene 

20 expression ever reported in transgenic plants. Exceedingly difficult to control insects (1 0-day old 
cotton bollworm, beetanny worm) were killed 1 00% after consuming transgenic leaves. Electron 
micrographs showed the presence of the insecticidal protein folded into cuboidal crystals similar in 
shape to Cry2Aa2 crystals observed in Bacillus thuringiensis (see figure number 6 in attached article 
De Cosaetal. 2001,29). 

25 In contrast to currently marketed transgenic plants with soluble CRY proteins, folded 

protoxin crystals are processed onlyby target insects that have alkaline gutpH. This approach should 
improve safety of Bt transgenic plants. Absence of insecticidal proteins in transgenic pollen 
eliminates toxicity to non-target insects via pollen. In addition to these environmentally friendly 
approaches, this observation should serve as a model system for large-scale production of foreign 

30 proteins within chloroplasts in a folded configuration enhancing their stability and facilitating single 
step purification. This is the first demonstration of expression of a bacterial operon in transgenic 
plants and opens the door to engineer novel pathways in plants in a single transformation event. 

Expressing small peptides via the chloroplast genome: It is common knowledge that the medical 
35 community has been fighting a vigorous battle against drug resistant pathogenic bacteria for years. 
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Cationic antibacterial peptides from mammals, amphibians and insects have gained more attention 
over the last decade (36). Key features of these cationic peptides are anet positive charge, an affinity 
for negatively-charged prokaryotic membrane phospholipids over neutral-charged eukaryotic 
membranes and the ability to form aggregates that disrupt the bacterial membrane (37). 

5 There are three maj or peptides with a-helical structures, cecropin from Hyalophora cecropia 

(giant silk moth), magainins from Xenopus laevis (African frog) and defensins from mammalian 
neutrophils. Magainin and its analogues have been studied as a broad-spectrum topical agent, a 
systemic antibiotic; a wound-healing stimulant; and an anticancer agent (38). We have recently 
observed that a synthetic lytic peptide (MSI-99, 22 amino acids) can be successfully expressed in 

1 0 tobacco chloroplast (39). The peptide retained its lytic activity against the phytopathogenic bacteria 
Pseudomonas syringae and multidrug resistant human pathogen, Pseudomonas aeruginosa. The 
anti-microbial peptide (AMP) used in this study was an amphipathic alpha-helix molecule that has an 
affinity for negatively charged phospholipids commonly found in the outer-membrane of bacteria. 
Upon contact with these membranes, individual peptides aggregate to form pores in the 

15 membrane, resulting in bacterial lysis. Because of the concentration dependent action of the AMP, 
it was expressed via the chloroplast genome to accomplish high dose delivery at the point of infection. 
PCR products and Southern blots confirmed chloroplast integration of the foreign genes and 
homoplasmy. Growth and development of the transgenic plants was unaffected by hyper-expression 
of the AMP within chloroplasts. In vitro assays with T 0 and T, plants confirmed that the AMP was 

20 expressed at high levels (21.5 to 43% of the total soluble protein) and retained biological activity 
against Pseudomonas syringae, a major plant pathogen. In situ assays resulted in intense areas of 
necrosis around the point of infection in control leaves, while transformed leaves showed no signs 
of necrosis (200-800 ug of AMP at the site of infection) as shown in Fig. 1 . T l in vitro assays against 
Pseudomonas aeruginosa (a multi-drug resistant human pathogen) displayed a 96% inhibition of 

25 growth as shown in Fig. 2. These results give a new option in the battle against phytopathogenic and 
drug-resistant human pathogenic bacteria. Small peptides (like insulin) are degraded in most 
organisms. However, stability of this AMP in chloroplasts opens up this compartment for expression 
of hormones and other small peptides. 

30 Expression of cholera toxin P subunit oligomers as a vaccine in chloroplasts: Vibrio choleras, 
which causes acute watery diarrhea by colonizing the small intestine and producing the enterotoxin, 
cholera toxin (CT). Cholera toxin is a hexameric AB 5 protein consisting of one toxic 27kDa A 
subunit having ADP ribosyi transferase activity and a nontoxic pentamer of 1 1.6 kDa B subunits 
(CTB) that binds to the A subunit and facilitates its entry into the intestinal epithelial cells. CTB 

35 when administered orally (40) is a potent mucosal immunogen which can neutralize the toxicity of 
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the CT holotoxin by preventing it from binding to the intestinal cells (41). This is believed to be a 
result of it binding to eukaryotic cell surfaces via the G M1 gangliosides, receptors present on the 
intestinal epithelial surface, thus eliciting a mucosal immune response to pathogens (42) and 
enhancing the immune response when chemically coupled to other antigens (43 - 46). 
5 Cholera toxin (CTB) has previously been expressed in nuclear transgenic plants at levels of 

0.01 (leaves) to 0.3% (tubers) of the total soluble protein. To increase expression levels, we 
engineered the chloroplast genome to express the CTB gene (47). We observed expression of 
oligomeric CTB at levels of 4-5% of total soluble plant protein as shown in Fig. 3 A. PCR and 
Southern Blot analyses confirmed stable integration of the CTB gene into the chloroplast genome. 

1 0 Western blot analysis showed that transgenic chloroplast expressed CTB was antigenically identical 
to commercially available purified CTB antigen as shown in Fig. 4. Also, G M1 -ganglioside binding 
assays confirm that chloroplast synthesized CTB binds to the intestinal membrane receptor of cholera 
toxin as shown in Fig. 3B. Transgenic tobacco plants were morphologically indistinguishable from 
untransformed plants and the introduced gene was found to be stably inherited in the subsequent 

15 generation as confirmed by PCR and Southern Blot analyses. The increased production of an 
efficient transmucosal carrier molecule and delivery system, like CTB, in chloroplasts of plants makes 
plant based oral vaccines and fusion proteins with CTB needing oral administration, a much more 
feasible approach. This also establishes unequivocally that chloroplasts are capable of forming 
disulfide bridges to assemble foreign proteins. 

20 

Expression and assembly of monoclonals in transgenic chloroplasts: Dental caries (cavities) is 
probably the most prevalent disease of humankind. Colonization of teeth by S. mutans is the single 
most important risk factor in the development of dental caries. S. mutans is a non-motile, gram 
positive coccus. It colonizes tooth surfaces and synthesizes ghicans (insoluble polysaccharide) and 

25 fructans from sucrose using the enzymes glucosyltransferase and fructosyltransferase respectively 
(48). The glucans play an important role by allowing the bacterium to adhere to the smooth tooth 
surfaces. After its adherence, the bacterium ferments sucrose and produces lactic acid. Lactic acid 
dissolves the minerals of the tooth, producing a cavity. 

A topical monoclonal antibody therapy to prevent adherence of S. mutans to teeth has 

30 recently been developed. The incidence of cariogenic bacteria (in humans and animals) and dental 
caries (in animals) was dramatically reduced for periods of up to two years after the cessation of the 
antibody therapy. No adverse events were detected either in the exposed animals or in human 
volunteers (49). The annual requirement for this antibody in the US alone may eventually exceed 1 
metric ton. Therefore, this antibody was expressed via the chloroplast genome to achieve higher 

35 levels of expression and proper folding (50). The integration of antibody genes into the chloroplast 



WO 01/72959 



74 



PCT/US01/06288 



genome was confirmed by PCR and Southern blot analysis. The expression of both heavy and light 
chains was confirmed by western blot analysis under reducing conditions as shown in Figs. 5 A and 
B. The expression of fully assembled antibody was confirmed by western blot analysis under non- 
reducing conditions as shown in Fig. 5C. This is the first report of successful assembly of a multi- 
5 subunit human protein in transgenic chloroplasts. Production of monoclonal antibodies at agricultural 
level should reduce their cost and create new applications of monoclonal antibodies. 

Significance: Medical molecular pharming in transgenic plants has been reviewed recently (51). 
Since the demand for cheap and safe sources of HSA is expected to increase considerably in the 

1 0 coming years, it would be wis e to ensure that in the future this protein will be available in significantly 
larger amounts, preferably on a cost-effective basis. Because most genes can be expressed in many 
different systems, it is essential to determine which system offers the most advantages for the 
manufacture of the recombinant protein. The ideal expression system is one that produces a 
maximum amount of safe, biologically active material at a minimum cost. The use of modified 

1 5 mammalian cells with recombinant DN A techniques has the advantage of resulting in products which 
are closely related to those of natural origin. However, culturing these cells is intricate and can only 
be carried out on limited scale. 

The use of microorganisms such as bacteria permits manufacture on a larger scale, but 
introduces the disadvantage of producing products, which differ appreciably from the products of 

20 natural origin. For example, proteins that are usually glycosylated in humans are not glycosylated by 
bacteria. Furthermore, human proteins that are expressed at high levels in E. coli frequently acquire 
an unnatural conformation, accompanied by intracellular precipitation due to lack of proper folding 
and disulfide bridges. Production of recombinant proteins in plants has many potential advantages 
forgeneratingbiopharmaceuticalsrelevantto clinical medicine. These include the following: (i)plant 

25 systems are more economical than industrial focilities using fermentation systems; (ii) technology is 
available for harvesting and processingplants/plantproducts on a large scale; (iii) elimination of the 
purification requirement when the plant tissue containing the recombinant protein is used as a food 
(edible vaccines); (iv) plants can be directed to target proteins into stable, intracellular compartments 
as chloroplasts, or expressed directly in chloroplasts; (v) the amount of recombinant product that can 

30 be produced approaches industrial-scale levels; and (vi) health risks due to contamination with 
potential human pathogens/toxins are minimized. 

It has been estimated that one tobacco plant should be able to produce more recombinant 
protein than a 300-liter fermenter of E. coli. In addition, a tobacco plant produces a million seeds, 
facilitating large-scale production. Tobacco is also an ideal choice because of its relative ease of 

35 genetic manipulation and an impending need to explore alternate uses for this hazardous crop. 
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However, with the exception of enzymes (e.g. phytase), levels of foreign proteins produced in nuclear 
transgenic plants are generally low, mostly less than 1% of the total soluble protein (52). May et al. 
(53) discuss this problem using the following examples. Although plant derivedrecombinant hepatitis 
B surface antigen was as effective as a commercial recombinant vaccine, the levels of expression in 
5 transgenic tobacco were low (0.0066% of total soluble protein). Even though Norwalk virus capsid 
protein expressed in potatoes caused oral immunization when consumed as food (edible vaccine), 
expression levels were low (0.3% of total soluble protein). In particular, expression of human 
proteins in nuclear transgenic plants has been disappointingly low: e.g. human Interferon-p 
0.000017% of fresh weight, human serum albuminO.02% and erythropoietin 0.0026% of total soluble 

10 protein (see Tablel in ref. 52). A synthetic gene coding for the human epidermal growth factor was 
expressed only up to 0.001% of total soluble protein in transgenic tobacco (53). 

The cost of producing recombinant proteins in alfalfa leaves was estimated to be 12-fold 
lower than in potato tubers and comparable with seeds (52). However, tobacco leaves are much 
larger and have much higher biomass than alfalfa. The cost of production of recombinant proteins 

1 5 will be 50-fold lower than that of E. coli fermentation (with 20% expression levels, 52). A decrease 
in insulin expression from 20% to 5% of biomass doubled the cost of production (54). Expression 
level less than 1% of total soluble protein in plants has been found to be not commercially feasible 
(52). Therefore, it is important to increase levels of expression of recombinant proteins in plants to 
exploit plant production of pharmacologically important proteins. 

20 An alternate approach is to express foreign proteins in chloroplasts of higher plants . We have 

recently integrated foreign genes (up to 1 0,000 copies per cell) into the tobacco chloroplast genome 
resulting in accumulation of recombinant proteins up to 46% of the total cellular protein (29). 
Chloroplast transformation utilizes two flanking sequences that, through homologous recombination, 
insert foreign DNA into the spacer region between the functional genes of the chloroplast genome, 

25 thus targeting the foreign genes to a precise location. This eliminates the "position effect" and gene . 
silencing frequently observed in nuclear transgenic plants. Chloroplast genetic engineering is an 
environmentally friendly approach, minimizing concerns of out-cross of introduced traits via pollen 
to weeds or other crops. Most importantly, a significant advantage in the production of 
pharmaceutical proteins in chloroplasts is their ability to process eukaryotic proteins, including folding 

30 and formation of disulfide bridges (55). Chaperonin proteins are present in chloroplasts (56,57) that 
function in folding and assembly of prokaryotic/eukaryotic proteins. Also, proteins are activated by 
disulfide bond oxido/reduction cycles using the chloroplast thioredoxin system (58) or chloroplast 
protein disulfide isomerase (59) . Accumulation of fully assembled, disulfide bonded form of human 
somatotropin via chloroplast transformation (31) and oligomeric form of CTB (47) and assembly of 

35 heavy and light chains of humanized Guy's 13 antibody in transgenic chloroplasts (50) provide strong 
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evidence for successful processing of pharmaceutical proteins inside chloroplasts. Such folding and 
assembly should eliminate the need for highly expensive in vitro processing of pharmaceutical 
proteins. For example, 60% of the total operating cost in the production of human insulin is 
associated with in vitro processing (formation of disufide bridges and cleavage of methionine)(54). 

5 

Taken together, low levels of expression of human proteins in nuclear transgenic plants, and 
difficulty in folding, assembly/processing of human proteins in E.coli should make chloroplasts an 
ideal compartment for expression of recombinant proteins. Production of human proteins in 
transgenic chloroplasts also dramatically lowers the production cost. Large-scale production of 
10 human serum albumin in plants is a powerful approach to provide safe treatment to patients at an 
affordable cost and provide tobacco farmers alternate uses for this hazardous crop. Therefore, it 
would be highly advantageous to provide for expression of human serum albumin in transgenic 
tobacco chloroplasts to increase levels of expression and accomplish in vivo processing. 

1 5 SUMMARY OF THE INVENTION 

This invention synthesizes high value pharmaceutical proteins in nuclear transgenic plants by 
chloroplast expression for pharmaceutical protein production. Chloroplasts are suitable for this 
purpose because of their ability to process eukaryotic proteins, including folding and formation of 
disulfide bridges, thereby eliminating the need for expensive post-purification processing. Tobacco 

20 is an ideal choice for this purpose because of its large biomass, ease of scale-up (million seeds per 
plant) and genetic manipulation. We use poly(GVGVP) as a fusion protein to enable hyper- 
expression of human serum albumin and accomplish rapid one step purification of fusion peptides 
utilizing the inverse temperature transition properties of this polymer. We also use human serum 
albumin-CTB fusion protein in chloroplasts of nicotine free edible tobacco (LAMD 605) for oral 

25 delivery to NOD mice. 

BRIEF DESCRIPTION OF DRAWINGS 
Fig. 1 shows apair of photographs of leaves infected with 10 iA of 8xl0 5 , 8xl0 4 , 8xl0 3 and 
8X10 2 cells of P. syringae, taken 5 days after inoculation. 
30 Fig. 2 is a graph of absorbance of 600 nm of total plant protein mixed with 5 iA of mid-log 

phase bacteria from overnight culture, incubated for two hours at 25 °C at 1 25 rpm and grown in LB 
broth overnight. 

Fig. 3 A is a graph of CTB ELISA quantification shown as percentage of the total soluble 
plant protein. 

35 Fig. 3B is a graph of CTB-GM1 ganglioside binding ELISA assay of plates coated first with 
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GM1 gangliosides and BSA were plated with total soluble plant protein. 

Fig. 4 is a photograph of a 12% reducing PAGE of expression of CTB oligomers. 

Figs. 5A and B show photographs of reducing gels of expression and assembly of disulfide 
bonded Guy's 13 monoclonal antibody. 
5 Fig. SCisa photograph of a non-reducing gel. 

Fig. 6 is a photograph of a Western Blot of expression of HS A via nuclear genome in potato. 

Fig. 7 is a pair of frequency histograms including percentage Kennebec andDesiree transgenic 
plants expressing different HSA levels. 
10 Fig. 8 is a photograph of a Western Blot of expression of HSA by chloroplat vectors in E. 

coli. 

Fig. 9 is a photograph of a Western Blot of expression of HSA via chloroplast genome in 
tobacco. 

Fig. 1 OA is a map of the pLD chloroplast transformation vector and primer landing sites. 
1 5 Fig. 10B is a photograph of an Agarose gel containing PCR products using total plant DNA 

as template from transformed plants. 

DETAILED DESCRIPTION 
Expression of HSA via nuclear genome in potato: Recently, our collaborators in Spain cloned the 

20 human HSA cDNA from human liver cells and fused the patatin promoter (whose expression is tuber 
specific (60)) along with the leader sequence of PIN H (proteinase H inhibitor potato transit peptide 
that directs HSA to the apoplast (61)). Leaf discs of Desiree and Kennebec potato plants were 
transformed using Agrobacterium tumefaciens. A total of 98 transgenic Desiree clones and 30 
Kennebec clones were tested by PCR and western blots. Westernblots showed that the recombinant 

25 albumin (rHSA) had been properly cleaved by the proteinase II inhibitor transit peptide in Fig. 6. 
Expression levels of both cultivars were very different among all transgenic clones as expected as 
shown in Fig. 7, probably because of position effects and gene silencing (62,63). The population 
distribution was similar in both cultivars: the majority of transgenic clones showed expression levels 
between 0.04 and 0.06% of rHSA in the total soluble protein. The maximum recombinant HSA 

30 amount expressed was 0.2%. Between one and five T-DNA insertions per tetraploid genome were 
observed in these clones. Plants with higher protein expression were always clones with several 
copies of the HSA gene. Levels ofmRNA were analyzed by Northern blots. There was a correlation 
between transcript levels andrccombinant albumin accumulation in transgenic tubers. TheN-terminal 
sequence showed proper cleavage of the transit peptide and the amino terminal sequence between 

35 recombinant and human HSA was identical. Inhibition of patatin expression using the antisense 
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technology did not improve the amount of rHSA. Average expression level among 29 anti sense 
transgenic plants was 0.032% of total soluble protein, with a maximum expression of 0.1%. The 
maximum HSA expression level observed was 5-10 times more than that reported by Sijmons et al. 
(12). However, higher levels are needed to make plant derived HSA production commercially 
5 feasible. 

Chloroplast expression of HSA: We have also initiated transformation of the tobacco chloroplast 
genome for hyperexpression of HSA, which is a new technology that has reported the highest 
expression levels in plants (29). The HSA codon composition is advantageous for chloroplast 
1 0 expression and no changes in the nucleotide sequence were needed. pLD vector was used for ail the 
constructs. We designed several vectors to optimize HSA expression. All these contain ATG as the 
first amino acid of the mature protein. 

1 - RBS-ATG-HSA : The first vector includes the gene that codes for the mature HSA plus an 
additional ATG as a translation initiation codon. We included the ATG in one of the primers of the 
15 PCR, 5 nucleotides downstream of the chloroplast preferred RBS sequence GGAGG. The cDNA 
sequence of the mature HSA was used as template. The PGR product was cloned into PCR 2.1 
vector, excised as an EcoRI-NotI fragment and introduced into the pLD vector. 

2- 5 T UTRpsbA-ATG-HSA : The 200 bp tobacco chloroplast DNA fragment containing the 5 ' psb A 
UTR (untranslated region) was amplified using PCR and tobacco DNA as template. The fragment 

20 was cloned into PCR 2.1 vector, excised EcoRI-Ncol fragment was inserted at the Ncol site of the 
ATG-HSA and finally inserted into the pLD vector as an EcoRI-NotI fragment downstream of the 
16S rRNA promoter to enhance translation of the protein. 

3- BtORFl+2-ATG-HSA : ORF1 and ORF2 of the Bt Cry2Aa2 operon were amplified in a PCR 
using the complete operon as a template. The fragment was cloned into PCR 2. 1 vector, excised as 

25 an EcoRI-EcoRV fragment, inserted at EcoRV site with the ATG-HSA sequence and introduced into 
the pLD vector as an EcoRJ-NotI fragment. The ORF1 and ORP2 were fused upstream of the ATG- 
HSA. 

4- BtORFlH-2-5 , UTRpsbA -ATG-HSA : The 5'UTRpsbA was introduced in the vector number 3 
upstream of the HSA in the EcoRV-Ncol site. 

30 Expression of chloroplast vectors was first tested in E.coli before their use in tobacco 

transformation because of the similarity of protein synthetic machinery (64). Different levels of. 
expression were obtained in E. coli depending on the construct as shown in Fig. 8. Using the psbA 
5 , UTR and the ORF1 and ORF2 of the cry2Aa2 operon, we obtained higher levels of expression than 
using only the RBS. We observed in previous experiments that HSA in E. coli is completely 

35 insoluble, probably due to an improper folding resulting from the absence of disulfide bonds. This 
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is the reason why the protein is precipitated in the gel as shown in Fig. 8 . Different polypeptide sizes 
were observed, probably due to incomplete translation. Assuming that E. coli and chloroplast have 
similar protein synthesis machinery, one could expect different levels of expression in transgenic 
tobacco chloroplasts depending on the regulatory sequences, with the advantage that disulfide bonds 
5 are formed in chloroplasts (31). These four vectors were bombarded into tobacco leaves via particle 
bombardment (65) and after 4 weeks small shoots appeared as a result of independent transformation 
events . They all were tested by PCR to check integration in the chloroplast genome as shown in Figs . 
10A and B. The positive clones were transferred to pots. Transgenic leaves analyzed by western 
blots showed different levels of expression depending on the 5* region used in the transfromation 
1 0 vector. Maximum levels were observed in the plants transformed with the HS A preceded by the 5' 
UTR of the psbA gene as shown in Fig. 9. Quantification of the HS A and molecular analysis of these 
transformants are in progress. 

1) Evaluation of chloroplast gene expression: A systematic approach to identify and 

15 overcome potential limitations of foreign gene expression in chloroplasts of transgenic plants is 
essential. Information gained in this study should increase the utility of chloroplast transformation 
system by scientists interested in expressing other foreign proteins. Therefore, it is important to 
systematically analyze transcription, RNA abundance, RNA stability, rate of protein synthesis and 
degradation, proper folding and activity. For example, the rate of transcription of the introduced 

20 HS A gene will be compared with the highly expressing endogenous chloroplast genes (rbcL, psbA, 
1 6S rRNA), using run on transcription assays to determine if the 16SrRNA promoter is operating as 
expected. Transgenic chloroplast containing each of the constructs with different 5' regions (see 
preliminary studies) will be investigated to test their transcription efficiency. Similarly, transgene 
RNA levels will be monitored by northerns, dot blots and primer extension relative to endogenous 

25 rbcL, 16S rRNA or psbA. These results, along with run on transcription assays, should provide 
valuable information of RNA stability, processing, etc. With our past experience in expression of 
several foreign genes, RNA appears to be extremely stable based on northern blot analysis. However, 
a systematic study would be valuable to advance utility of this system by other scientists. Most 
importantly, the efficiency of translation will be tested in isolated chloroplasts and compared with the 

30 highly translated chloroplastprotein (psbA). Pulse chase experiments will help assess if translational 
pausing, premature termination occurs. Evaluation of percent RNA loaded on polysomes or in 
constructs with or without 5 XJTRs will help determine the efficiency of the ribosome binding site and 
5* stem-loop translational enhancers. In our recent experience, we observed a 200-fold difference in 
accumulation of foreign proteins due to decreases in proteolysis conferred by a putative chaperonin 

35 (29). Therefore, proteins from constructs expressing or not expressing the putative chaperxmin (with 
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or without ORF1+2) should provide valuable information on protein stability* Thus, this information 
will be used to improve the next generation of chloroplast vectors. 

2) Expression of the mature protein: HS A is a pre-protein that is cleaved in the N-terminal 
5 to secrete the mature protein. The codon for translation initiation is in the presequence. In 
chloroplasts, the necessity of expressing the mature protein introduces this additional amino acid in 
the coding sequence. The sequence of the mature protein is first subcloned beginning with an ATG 
to optimize expression levels. Subsequent immunological assays in mice are performed with the 
protein to investigate if the extra-methionine can cause immunogenic response or low bioactivity. 
10 Alternatively, different systems can produce the mature protein. These systems can include the 
synthesis of a protein fused to a peptide that is cleaved intracellulary (processed) by chloroplast 
enzymes or the use of chemical or enzymatic cleavage after partial purification of proteins from plant 
cells. 

15 Use of peptides that are cleaved in chloroplast: Staub et al. (31) reported chloroplast expression 
of human somatotropin similar to the native human protein by using ubiquirin fusions that were 
cleaved in the stroma by an ubiquitin protease. However, the processing efficiency ranged from 30 - 
80% and the cleavage site was not accurate. To process chloroplast expressed proteins a peptide 
which is cleaved in the stroma is essential. The transit peptide sequence of the RuBisCo (ribulose 

20 1 5 5-bisphosphate carboxylase) small subunit is an advantageous choice. This transit peptide has been 
studied in depth (66). RuBisCo is one of the proteins that is synthesized in cytoplasm and transported 
postranslationally into the chloroplast in an energy dependent process. The transit peptide is 
proteolyticallyremoved upon transportin the stromabythe stromal processingpeptidase (67). There 
are several sequences described for different species (68). A transit peptide consensus sequence for 

25 the RuBisCo small subunit of vascular plants is published by Keegstra et al. (69). The amino acids 
that are proximal to theC-terminal (41 - 59) are highly conserved in the higher plant transit sequences 
and belong to the domain which is involved in enzymatic cleavage (66). The RuBisCo small subunit 
transit peptide has been fused with various marker proteins (69,70), even with animal proteins 
(71,72), to target proteins to the chloroplast. 

30 Prior to transformation studies, cleavage efficiency and accuracy is tested by in vitro 

translation of the fusion protein and in organello import studies using intact chloroplasts. Once the 
correct fusion sequence for producing the mature protein is known, such sequence encoding the 
amino terminal portion of tobacco chloroplast transit peptide is linked with the mature sequence of 
the protein. Codon composition of the tobacco RuBisCo small subunit transit peptide appears to be 

35 compatible with chloroplast optimal translation (see section 3 and Table on page 13). Additional 
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transit peptide sequences for targeting and cleavage in the chloroplast have been described (66). In 
cases where the RuBisCo small subunit transit peptide is not suitable, other transit peptides with 
cleavage in stroma can be used. The lumen of thylakoids is a good target because thylakoids are easy 
to purify. It is relatively easy to free lumenal proteins either by sonication or with a very low triton 
5 XI 00 concentration. However, this often requires insertion of additional amino acid sequences for 
efficient import (66). 

Use of chemical or enzymatic cleavage: The strategy of fusing a protein to a tag with affinity for 
a certain ligand has been used to enable affinity purification of recombinant products (73 - 75). 

10 However, scale up of this technology is usually quite expensive. Avast number of cleavage methods, 
both chemical and enzymatic, have been investigated for this purpose (75). Chemical cleavage 
methods have low specificity and the relatively harsh cleavage conditions can result in chemical 
modifications of thereleased products (75). Some of the enzymatic methods offer significantlyhigher 
cleavage specificities together with high efficiency, e. g. H64A subtilisin, IgA protease and factor Xa 

15 (74,75), but these enzymes have the drawback of being quite expensive. 

Trypsin, which cleaves C-terminal of basic amino-acid residues, has been used for a long time 
to cleave fusion proteins (3,76). Despite expected low specificity, trypsin has been shown to be 
useful for specific cleavage of fusion proteins, leaving basic residues within folded protein domains 
uncleavaged (76), The use of trypsin only requires that the N-terminus of the mature protein be 

20 accessible to the protease and that the potential internal sites are protected in the native conformation. 
Trypsin has the additional advantage ofbeing inexpensive and readily available. In the case of HSA, 
when it was expressed in E, coli with 6 additional codons coding for a trypsin cleavage site, HSA was 
processed successfully into the mature protein after treatment with the protease. In addition, the N- 
terminal sequence was found to be unique and identical to the sequence of natural HSA, the 

25 conversion was complete and no degradation products were observed (3). This in vitro maturation 
is selective because correctly folded albumin is highly resistant to trypsin cleavage at inner sites (3). 
This system could be tested for chloroplasts HSA vectors using protein expressed in E. coli, 

Staub et al. (3 1) demonstrated that the chloroplast methionine aminopeptidase is active and 
they found 95% of removal of the first methionine of an ATG-somatotropin protein that was 

30 expressed via the chloroplast genome. There are several investigations that have shown a very strict 
pattern of cleavage by this peptidase (77). Methionine is only removed when second residues are 
glycine, alanine, serine, cysteine, threonine, proline or valine, but if the third amino acid is proline the 
cleavage is inhibited. For HSA the second aminoacid is aspartic acid, so the cleavage may not be 
possible. 
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3) Optimization of gene expression: We have reported that foreign genes are expressed 
between 3% {cry2Aa2) and 46% {cry2Aa2 operon) in transgenic chloroplasts (26,29). Based on the 
outcome of the evaluation of HSA chloroplast transgenic plants, several approaches will be used to 
enhance translation of the recombinant proteins. In chloroplasts, transcriptional regulation of gene 
expression is less important, although some modulations by light and developmental conditions are 
observed (78). RNA stability appears to be one among me least problems because of observation of 
excessive accumulation of foreign transcripts, attunes 16,966-fold higher than the highly expressing 
nuclear transgenic plants (79) . Chloroplast gene expression is regulated to a large extent at the post- 
transcriptional level. For example, 5' UTRs are necessary for optimal translation of chloroplast 
mRNAs. Shine-Dalgarno (GGAGG) sequences as well as a stem-loop structure located 5* adjacent 
to the SD sequence are required for efficient translation. A recent study has shown that insertion of 
the psbA 5' UTR downstream of the 16S rRNA promoter enhanced translation of a foreign gene 
(GUS) hundred-fold (80). Therefore, the 200-bp tobacco chloroplast DNA fragment (1680-1480) 
containing 5' psbA UTR is used. This PCR product is then inserted downstream of the 16S rRNA 
promoter to enhance translation of the recombinant proteins. 

Yet another approach for enhancement of translation can be the codon composition 
optimization. It is reasonable to expect efficient expression in chloroplasts since the protein is 
translated in E. colt Although rbcL (RuBisCO) is the most abundant protein on earth, it is not 
translated as highly as the psbA gene due to the extremely high turnover of the psbA gene product. 
The psbA gene is under stronger selection for increased translation efficiency and is the most 
abundant thylakoid protein. In addition, the codon usage in higher plant chloroplasts is biased 
towards the NNC codon of 2-fold degenerate groups (i.e. TTC over TTT, GAC over GAT, CAC 
over CAT, AAC over AAT, ATC over ATT, ATA etc.). This is in addition to a strong bias towards 
T at third position of 4-fold degenerate groups. There is also a context effect that should be taken 
into consideration while modifying specific codons. The 2-fold degenerate sites immediately 
upstream from a GNN codon do not show this bias towards NNC. (TTT GGA is preferred to TTC 
GGA while TTC CGT is preferred to TTT CGT 5 TTC AGT to TTT AGT and TTC TCT to TTT 
TCT)(81,82). 

hi addition, highly expressed chloroplast genes use GNN more frequently than other genes. 
The disclosure found in web site http://www.k azusa.or.tp/codon was used to analyze codon 
composition by comparing different species. Abundance of amino acids in chloroplasts and tRNA 
anticodons present in chloroplast was taken into consideration. We also compared A-KT% content 
of all foreign genes that had been expressed in transgenic chloroplasts in our laboratory with the 
percentage of chloroplast expression. We found that higher levels of A+T always correlated with 
high expression levels (see Table 1). The HSA sequence showed 57% of A+T content and 40% of 
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the total codons matched with the psbA most translated codons. According to the data of the Table 
and taking into consideration all these factors, good chloroplast expression of the HS A gene without 
modifications in its codon composition can be expected. 
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4) Vector constructions: pLD vector is used for the constructs. This vector was developed 
for chloroplast transformation. It contains the 16S rRNA promoter (Prm) driving the selectable 
marker gene aadA (aminoglycoside adenyl transferase conferring resistance to spectinomycm) 
followed by the psbA 3' region (the terminator from a gene coding for photosystem II reaction center 

20 components) from the tobacco chloroplast genome. The pLD vector is a universal chloroplast 
expression /integration vector and can be used to transform chloroplast genomes of several other 
plant species (14,27) because these flanking sequences are highly conserved among higher plants. 
The universal vector uses tmA and trnl genes (chloroplast transfer RNAs coding for Alanine and 
Isoleucine) from the inverted repeat region of the tobacco chloroplast genome as flanking sequences 

25 for homologous recombination. Because the universal vector integrates foreign genes within the 
Inverted Repeat region of the chloroplast genome, it doubles the copy number of the transgene (from 
5000 to 1 0,000 copies per cell in tobacco). Furthermore, it has been demonstrated that homoplasmy 
is achieved even in the first round of selection in tobacco probably because of the presence of a 
chloroplast origin of replication within the flanking sequence in the universal vector (thereby 

30 providing more templates for integration). Because of these and several other reasons, foreign gene 
expression was shown to be much higher when the universal vector was used instead of the tobacco 
specific vector (30). 

The following vectors can be used to optimize protein expression, purification and production 
of HSA with the same amino acid composition as the human protein. 
35 a) We increase translation using the psbA 5 *UTR to optimize expression. The 200 bp tobacco 
chloroplast DNA fragment containing 5' psbA is amplified by PCRusing tobacco chloroplast . 
DNA as template. This fragment is cloned directly in the pLD vector multiple cloning site 
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(EcoRI-Ncol) downstream of the promoter and the aadA gene. The cloned sequence is the 
same as in the psbA gene. 

b) Forenhancingprotemstabilitya^ 

operon derived putative chaperonin is used. Expression of the cry2Aa2 operon in 
chloroplasts provides a model system for hyper-expression of foreign proteins (46% of total 
soluble protein) in a folded configuration enhancing their stability and f acilitatingpurification 
(29). This justifies inclusion of the putative chaperonin from the cry2Aa2 operon in one of 
the newly designed constructs. In this region there are two open reading frames (ORF1 and 
ORF2) and a ribosomal binding site (rbs). This sequence contains elements necessary for 
Cry2Aa2 crystallization which help to fold or crystallize the HSA protein helping in the 
subsequent purification. Successful crystallization of other proteins using this putative 
chaperonin has been demonstrated (35). We amplify the ORF1 andORP2oftheBt Cry2Aa2 
operon by PCR using the complete operon as template. The fragment is cloned into a PCR 
2.1 vector and excised as an EcoRI-EcoRV product. This fragment is then cloned directly 
into the pLD vector multiple cloning site (EcoRI-EcoRV) downstream of the promoter and 
the aadA gene. 

c) To obtain HSA with the same amino acid composition as the mature human protein (without 
the extra methionine), we first fuse HSA with the RuBisCo small subunit transit peptide. 
Also, other constructions are performed to allow cleavage of the protein after isolation from 
chloroplast. 

The first set of constructs includes the sequence of HSA beginning with an ATG, introduced 
by PCR using primers. Once optimal expression levels are achieved, and when the ATG is shown 
to be a problem (determined by mice immunological assays), processing to produce the mature 
protein is addressed. The first attempt is the use of the RuBisCo small subunit transit peptide. This 
transit peptide is amplified by PCR using tobacco DNA as a template and cloned into the PCR 2. 1 
vector. The HSA gene is fused with the transit peptide using a Mlul restriction site that is introduced 
in the PCR primers for amplification of the transit peptide and the HSA coding sequence. The gene 
fusion is then inserted into the pLD vector downstream of the 5 'region that gives optimal expression 
of HSA (RBS S 5TJTRpsbA, ORF1+2, ORFl+2-5'UTRpsbA). Another approach to eliminate the 
ATG of the coding region is the use of the ATG before a protease recognition sequence, like trypsin, 
and remove in vitro such extra sequence to obtain the mature protein. Such sequences will be 
introduced by primers in a PCR. After completing vector constructions, the vectors are sequenced 
to confirm correct nucleotide sequence and in frame fusion. DNA sequencing is performed using a 
Perkin Elmer ABI prism 373 DNA sequencing system or the like. 
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5 

Because of the similarity of protein synthetic machinery (64), expression of chloroplast 
vectors is first tested in E.coli before their use in tobacco transformation. For Escherichia coli 
expression XL-1 Blue strain is used. Purification and cleavage assays is performed using K coli 
1 0 expressed protein. 

5) Bombardment, Regeneration and Characterization of Chloroplast Transgenic Plants: 

Tobacco {Nicotiana tabacum var. Petit Havana) plants are grown aseptically by germination of seeds 
on MSO medium. This medium contains MS salts (4.3 g/liter) 5 B5 vitamin mixture (myo-inositol, 

15 100 mg/liter; thiamine-HCl, 10 mg/liter; nicotinic acid, 1 mg/liter; pyridoxine-HCl, 1 mg/liter), 
sucrose (30 g/liter) and phytagar (6 g/liter) at pH 5,8. Fully expanded, dark green leaves of about 
two month old plants are used for bombardment. 

Leaves are placed abaxial side up on a Whatman No. 1 filter paper laying on the RMOP 
medium (20) in standard petri plates (100x1 5 mm) for bombardment Gold(0.6 jim)microprojectiies 

20 are coated with plasmid DNA (chloroplast vectors) and bombardments are carried out with the 
biolistic device PDS 1 000/He (Bio-Rad) as describedby Daniell (65). Following bombardment, petri 
plates are sealed with parafilm and incubated at 24°C under 12 h photoperiod. Two days after 
bombardment, leaves are chopped into small pieces of -5 mm 2 in size and placed on the selection 
medium (RMOP containing 500 fig/ml of spectinomycin dihydrochloride) with abaxial side touching 

25 themednimindeep(100x25mm)petriplates (-10 pieces perplate). The regenerated spectinomycin 
resistant shoots are chopped into small pieces (—2mm 2 ) and subcloned into fresh deeppetri plates (-5 
pieces per plate) containing the same selection medium. Resistant shoots from the second culture 
cycle are transferred to the rooting medium (MSO medium and spectinomycin dihydrochloride, 500 
mg/liter). Rooted plants are transferred to soil and grown at 26°C under 16 hour photoperiod 

30 conditions for further analysis. 

PGR analysis of putative transformants: PCR is performed using DNA isolated from control and 
transgenic plants to distinguish a) true chloroplast transformants from mutants and b) chloroplast 
transformants from nuclear transformants. Primers for testing the presence of the aadA gene (that 
3 5 confers spectinomycin resistance) in transgenic plants are landed on the aadA coding sequence and 
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16S rRNA gene. One primer lands on the aadA gene while another lands on the native chloroplast 
genome as shown in Fig. 10A to test chloroplast integration of the genes. No PCR product is 
obtained with nuclear transgenic plants using this set of primers. The primer set is used to test 
integration of the entire gene cassette without any internal deletion or looping out during homologous 
5 recombination. A similar strategy has been used successfully by us to confirm chloroplast integration 
of foreign genes (26-30). This screening is essential to eliminate mutants and nuclear transformants. 
Total DNA from unbombarded and transgenic plants is isolated as described by Edwards et al. (83) 
to conduct PCR analyses in transgenic plants. Chloroplast transgenic plants containing the desired 
gene are moved to second round of selection to achieve homoplasmy. 

10 

Southern Analysis for homoplasmy and copy number: Southern blots are performed to determine 
the copy number of the introduced foreign gene per cell as well as to test homoplasmy. There are 
several thousand copies of the chloroplast genome present in each plant cell. Therefore, when foreign 
genes are inserted into the chloroplast genome, it is possible that some of the chloroplast genomes 

15 have foreign genes integrated while others remain as the wild type (heteroplasmy). Therefore, to 
ensure that only the transformed genome exists in cells of transgenic plants (homoplasmy), the 
selection process is continued Total DNA from transgenic plants are probed with the chloroplast 
border (flanking) sequences (the trnl-trnA fragment) to confirm that the wild type genome does not 
exist at the end of the selection cycle. If wild type genomes are present (heteroplasmy), the native 

20 fragment size is observed along with transformed genomes. Presence of a large fragment (due to 
insertion of foreign genes within the flanking sequences) and absence of the native small fragment 
confirms homoplasmy (26,27,30). 

The copy number of the integrated gene is determined by establishing homoplasmy for the 
transgenic chloroplast genome. Tobacco chloroplasts contain 5000-10,000 copies of their genome 

25 per cell (27). When only a fraction of the genomes are actually transformed, the copy number, by 
default, must be less than 10,000. By establishing that in the transgenics the gene inserted 
transformed genome is the only one present, it can be established that the copy number is about 
5000-10,000 per cell. This is usually done by digesting the total DNA with a suitable restriction 
enzyme and probing with the flanking sequences that enable homologous recombination into the 

30 chloroplast genome. The native fragment present in the control should be absent in the transgenics. 
The absence of native fragment proves that only the transgenic chloroplast genome is present in the 
cell and there is no native, untransformed, chloroplast genome, without the foreign gene present. 
This establishes the homoplasmic nature of our transformants, simultaneously providing us with an 
estimate of about 5000-10,000 copies of the foreign genes per cell. 

35 
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Northern Analysis for transcript stability: Northern blots are performed to test the efficiency of 
transcription of the genes. Total RNA is isolated from 150mgoffrozenleavesbyusingthe"Rneasy 
Plant Total RNA Isolation Kit" (Qiagen Inc., Chatsworth, CA). RNA (10^0 ng) is denatured by 
formaldehyde treatment, separated on a 1,2% agarose gel in the presence of formaldehyde and 
transferred to a nitrocellulose membrane (MSI) as described in Sambrook et al. (84). Probe DNA 
(HSA gene coding region) is labeled by the random-primed method (Promega) with 32 P-dCTP 
isotope. The blot is pre-hybridized, hybridized and washed as described above for southern blot 
analysis. Transcript levels are quantified by the Molecular Analyst Program using the GS-700 
Imaging Densitometer (Bio-Rad, Hercules, CA) or the like. 

Expression and quantification of the total protein expressed in chloroplast: Chloroplast 
expression assays are performed by Western Blot. Recombinant protein levels in transgenic plants 
of first and second generation (To and Tl) are determined using quantitative ELISA assays. A 
standard curve is generated using known concentrations and serial dilutions of recombinant and native 
proteins. Different tissues are analyzed using young, mature and old leaves against goat anti-HSA 
(Nordic Immunology) antibodies. BoundlgG is measured using horseradishperoxidase-labelled anti- 
goat IgG (Sigma). 

Inheritance of Introduced Foreign Genes: While it is unlikely that introduced DNA moves from 
the chloroplast genome to nuclear genome, it is possible that the gene can be integrated in the nuclear 
genome during bombardment and remain undetected in Southern analysis. Therefore, in initial 
tobacco transformants, some is allowed to self-pollinate, whereas others are used in reciprocal crosses 
with control tobacco (transgenics as female accepters and pollen donors; testing for maternal 
inheritance). Harvested seeds (Tl) are germinated on media containing spectinomycin. Achievement 
of homoplasmy and mode of inheritance can be classified by looking at germination results. 
Homoplasmy can be indicated by totally green seedlings (27) while heteroplasmy is displayed by 
variegated leaves flack of pigmentation, 24). Lack of variation in chlorophyll pigmentation among 
progeny also underscores the absence of position effect, an artifact of nuclear transformation. 
Maternal inheritance is demonstratedby sole transmission of introduced genes via seed generated on 
transgenic plants, regardless of pollen source (green seedlings on selective media). When transgenic 
pollen is used for pollination of control plants, resultant progeny do not contain resistance to chemical 
in selective media (will appear bleached; 24). Molecular analyses confirm transmission and 
expression of introduced genes, and T2 seed is generated from those confirmed plants by the 
analyses described above. 
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6) Purification methods: The standard method of purification employs classical biochemical 
techniques with the crystallized proteins inside the chloroplast. In this case, the homogenates are 
passed through miracloth to remove cell debris. Centrifugation at 10,000 xg pellets all foreign 
proteins (29). Proteins are solubilized using pH, temperature gradient, etc. This is possible if the 
ORF1 and 2 of the cry2Aa2 operon can fold and crystallize the recombinant protein. When there is 
no crystal formation, other purification methods are applied (classical biochemistry techniques). 
Albumin is typically administered in tens of gram quantities. At a purity level of 99.999% (a level 
considered sufficient for other recombinant protein preparations), recombinant HSA (rHSA) 
impurities on the order of one mg is still injected into patients. Hence, impurities from the host 
organism must be reduced to a minimum. Furthermore, purified rHSA must be identical to human 
HSA. Despite these stringent requirements, purification costs must be kept low. It is not appropriate 
to apply conventional processes for purifying HSA originating in plasma to purify the HSA obtained 
by gene manipulation. This is because the impurities to be eliminated from rHSA differ from those 
contained in the HSA originating in plasma. Namely, rHSA is contaminated with, for example, 
coloring matters characteristic to recombinant HSA 3 proteins originating in the host cells, 
polysaccharides, etc. In particular, it is necessary to sufficiently eliminate components originating in 
the host cells, since they are foreign matters for living organisms including human and can cause the 
problem of antigenicity. 

hi plants, two different methods of HSA purification have been performed at laboratory scale. 
Sijmons et aL (12) transformed potato and tobacco plants vriihAgrobacterium tumefaciens. For the 
extraction and purification of HSA, 1000 gof stem and leaf tissue was homogenized in 1000 ml cold 
PBS, 0.6% PVP, 0.1 mM PMSF and 1 mM EDTA. The homogenate was clarified by filtration, 
centrifuged and the supernatant incubated for 4 h with L5 ml polyclonal antiHSA coupled to 
Reactigel spheres (Pierce Chem) in the presence of 0.5% Tween 80. The complex HSA-anti HSA- 
Reactigel was collected and washed with 5 ml 0.5% Tween 80 in PBS. HSA was desorbed from the 
reactigel complex with 2.5 ml of 0.1 M glycine pH 2.5, 10% dioxane, immediately followed by a 
buffer exchange with Sephadex G25 to 50 mM Tris pH 8. The sample was then loaded on a HR5/5 
MonoQ anion exchange column (Pharmacia) and eluted with a linear NaCl gradient (0 - 350 mM 
NaCl in 50 mM Tris pH 8 in 20 min at lml/min). Fractions containing the concentrated HSA (at 290 
mM NaCl) were lyophilized and applied toaHR 10/30 Sepharose 6 column (Pharmacia) in PBS at 
0.3 ml/min. However, this method uses affinity columns (polyclonal anti-HSA) that are very 
expensive to scale-up. Also, the protein is released from the column with 0. 1M glycine pH 2.5 that 
typically denatures the protein. Therefore, this method can be suitably modified. 

The second method is used for HSA extraction and purification from potato tubers (Dr. 
Mingo-Casters laboratory). After grinding the tuber in phosphate buffer pH 7.4 (1 mg/2ml), the 
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homogenate is filtered in miracloth and centrifuged at 14,000 rpm 15 minutes. After this step, 
another filtration of the supernatant in 0.45 urn filters is necessary. Then, chromatography of ionic 
exchange in FPLC using a DEAE Sepharose Fast Flow column (Arnersham) is required. Fractions 
recovered are passed through an affinity column (Blue Sepharose fast flow Arnersham) resulting in 
a product of high purity. HSA purification based on both methods can then be investigated. 

7) Characterization of the recombinant proteins: For the safe use of recombinant proteins 
as a replacement in any of the current applications, these proteins must be structurally equivalent and 
must not contain abnormal host-derived modifications. To confirm compliance with these criteria 
human and recombinant proteins can be compared using the currently highly sensitive and highly 
resolving techniques expected by the regulatory authorities to characterize recombinant products 
(85). 

1- Amino acid analysis : N-terminal sequence analysis is performed by Edman degradation using 
ABI 477A protein sequencer with an on-line 120Aphenylthiohydantoin-arnino acid analyzer. 
Automated C-terminal sequence analysis uses a Hewlett-Packard Gl 009A protein sequencer. 
The C-terminal tryptic peptide is isolated from tryptic digests by reverse-phase HPLC to 
confirm the C-terminal sequence to a greater number of residues. 

2- Protein folding and disulfide bridges formation : Western blots with reducing and non- 
reducing gels is performed to check protein folding. Protein standards (Sigma) are loaded 
to compare the mobility of the recombinant protein. PAGE is performed on PhastGels 
(Pharmacia Biotech). Proteins are blotted and then probed with goat anti-HS A antibodies. 
Bound IgG is detected with horseradish peroxidase-labelled anti goat IgG and visualized on 
X-ray film using ECL detection reagents (Arnersham). 

3- Chromatographic techniques : For HSA, analytical gel-permeation HPLC is performed using 
a TSK G3000 SWxI column. Preparative gel permeation chromatography of HSA is 
performed using a Sephacryl S200 HR column. The monomer fraction, identified by 
absorbance at 280 nm, is dialyzed and reconcentrated to its starting concentration. 

4- Viscosity : This is a classical assay for recombinant HSA. Viscosity is a characteristic of 
proteins related directly to their size, shape, and conformation. The viscosities of HSA and 
recombinant HSA are measured at 100 mgMl-1 inO.lSMNaClusingaU-tubeviscosimeter 
(M2 type, Poulton, Selfe and Lee Ltd, Essex, UK) at 25°C. 

5- Glvcosvlation : Chloroplast proteins are not known to be glycosylated. However there are 
no publications to confirm or refute this assumption. Therefore, glycosylation will be 
measured using a scaled-up version of the method of Ahmed and Furth (86). 
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8) Animal testing and Pre-CHnical Trials: When albumin is produced at adequate levels in 
tobacco and the physicochemical properties of the product correspond to those of the natural protein, 
toxicology studies need to be done in mice. To avoid mice response to the human protein, transgenic 
mice carrying HSA genomic sequences is used (87). After injection of none, 1, 10, 50 and 100 mg 
of purified recombinant protein, classical toxicology studies are carried out (body weight and food 
intake, animal behavior, piloerection, etc.)- Albumin can be tested for blood volume replacement 
after paracentesis to eliminate the fluid from the peritoneal cavity in patients with liver cirrhosis. It 
has been shown that albumin infusion after this maneuver is essential to preserve effective circulatory 
volume and renal function (88). 
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EXPRESSION OF HUMAN THERAPEUTIC PROTEINS 
IN TRANSGENIC TOBACCO CHLOROPLASTS 

BACKGROUND OF THE INVENTION 
5 FIELD OF THE INVENTION 

The present invention is directed to the expression of genes in plants to produce 
recombinant proteins. 

DESCRIPTION OF RELATED ART 

Research on human proteins in the past years has revolutionized the use of these 

10 therapeutically valuable proteins in a variety of clinical situations. Since the demand for these 
proteins is expected to increase considerably in the coming years, it would be wise to ensure that 
in the future they will be available in significantly larger amounts, preferably on a cost-effective 
basis. Because most genes can be expressed in many different systems, it is essential to 
determine which system offers the most advantages for the manufacture of the recombinant 

15 protein. The ideal expression system would be one that produces a maximum amount of safe, 
biologically active material at a niinhnum cost. The use of modified mammalian cells with 
recombinant DNA techniques has the advantage of resulting in products which are closely 
related to those of natural origin; however, cuituring of these cells is intricate and can only be 
carried out on limited scale. The use of microorganisms such as bacteria permits manufacture on 

20 a larger scale, but introduces the disadvantage of producing products, which differ appreciably 
from the products of natural origin. For example, proteins that are usually glycosylated in 
humans are not glycosylated by bacteria. Furthermore, human proteins that are expressed at high 
levels in E. coli frequently acquire an unnatural conformation, accompanied by intracellular 
precipitation due to lack of proper folding and disulfide bridges. Production of recombinant 

25 proteins in plants has many potential advantages for generating biopharmaceuticals relevant to 
clinical medicine. These include the foUowing: (I) plant systems are more economical than 
industrial facilities using fermentation systems; (ii) technology is available for harvesting and 
processing plants/ plant products on a large scale; (iii) elimination of the purification requirement 
when the plant tissue containing the recombinant protein is used as a food (edible vaccines); (iv) 

30 plants can be directed to target proteins into stable, intracellular compartments as chloroplasts, or 
expressed directly in chloroplasts; (v) the amount of recombinant product that can be produced 
approaches industrial-scale levels; and (vi) health risks due to contarnination with potential 
human pathogens/toxins are rriinimized. 

It has been estimated that one tobacco plant should be able to produce more recombinant 

35 protein than a 300-liter fermenter of E. coli. In addition, a tobacco plant produces a rnilhon 
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jig large-scale production. Tobacco is also an ideal choice because of its relative 
ease of genetic manipulation and an impending need to explore alternate uses for this hazardous 
crop. However, with the exception of enzymes (e.g. phytase), levels of foreign proteins produced 
in nuclear transgenic plants are generally low, mostly less than 1% of the total soluble protein 
5 (1). May et al. (2a) discuss this problem using the following examples. Although plant derived 
recombinant hepatitis B surface antigen was as effective as a commercial recombinant vaccine, 
the levels of expression in transgenic tobacco were low (0.0066% of total soluble protein). Even 
though Norwalk virus capsid protein expressed in potatoes caused oral immunization when 
consumed as food (edible vaccine), expression levels were low (0.3% of total soluble protein). In 

10 particular, expression of human proteins in nuclear transgenic plants has been disappointingly 
low: e.g. human Interferon-D 0.000017% of fresh weight, human serum albumin 0.02% and 
erythropoietin 0.0026% of total soluble protein (see tablel in refl). A synthetic gene coding for 
the human epidermal growth factor was expressed only up to 0.001% of total soluble protein in 
transgenic tobacco (2a). The cost of producing recombinant proteins in alfalfa leaves was 

15 estimated to be 12-fold lower than in potato tubers and comparable with seeds (1). However, 
tobacco leaves are much larger and have much higher biomass than alfalfa. The cost of 
production of recombinant proteins will be 50-fold lower than that of E.coli fermentation (with 
20% expression levels, 1). A decrease in insulin expression from 20% to 5% of biomass doubled 
the cost of production (2b). Expression level less than 1% of total soluble protein in plants has 

20 been found to be not commercially feasible (1). Therefore, it is important to increase levels of 
expression of recombinant proteins in plants in order to exploit plant production of 
pharmacologically important proteins. 

An alternate approach is to express foreign proteins in chloroplasts of higher plants. We 
have recently integrated foreign genes (up to 10,000 copies per cell) into the tobacco chloroplast 

25 genome resulting in accumulation of recombinant proteins up to 47% of the total cellular protein 
(3). Chloroplast transformation utilizes two flanking sequences that, through homologous 
recombination, insert foreign DNA into the spacer region between the functional genes of the 
chloroplast genome, thus targeting the foreign genes to a precise location. This eliminates the 
"position effect" and gene silencing frequently observed in nuclear transgenic plants. 

30 Chloroplast genetic engineering is an environmentally friendly approach, nnniinizing concerns 
of out-cross of introduced traits via pollen to weeds or other crops. Also, the concerns of insects 
developing resistance to biopesticides are mirtimized by hyper-expression of single insecticidal 
proteins (high dosage) or expression of different types of insecticides in a single transformation 
event (gene pyramiding). Concerns of insecticidal proteins on non-target insects are rninimized 

35 by lack of expression in transgenic pollen. Most importantly, a significant advantage in the 
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pharmaceutical proteins in chloroplasts is their ability to process eukaryotic 
proteins, including folding and formation of disulfide bridges (4). Chaperonin proteins are 
present in chloroplasts (5,6) that function in folding and assembly of prokaryotic/eukaryotic 
proteins. Also, proteins are activated by disulfide bond oxido/reduction cycles using the 

5 chloroplast thioredoxin system (7) or chloroplast protein disulfide isomerase (8). Accumulation 
of fully assembled, disulfide bonded form of human somatotropin via chloroplast transformation 
(9) and oligomeric form of CTB (10) and assembly of heavy and light chains of hurrianized 
Guy's 13 antibody in transgenic chloroplasts (11) provide strong evidence for successful 
processing of pharmaceutical proteins inside chloroplasts. Such folding and assembly should 

10 eliminate the need for highly expensive in vitro processing of pharmaceutical proteins. For 
example, 60% of the total operating cost in the production of human insulin is associated with in 
vitro processing (formation of disufide bridges and cleavage of methionine)(2b). 

Taken together, low levels of expression of human proteins in nuclear transgenic plants, 
and difficulty in folding, assembly/processing of human proteins in E.coli should make 

1 5 chloroplasts an ideal compartment for expression of these proteins; production of human proteins 
in transgenic chloroplasts should also dramatically lower the production cost. Large-scale 
production of these proteins in plants should be a powerful approach to provide treatment to 
patients at an affordable cost and provide tobacco farmers alternate uses for this hazardous crop. 
Therefore, we propose here expression of therapeutic proteins in transgenic tobacco chloroplasts 

20 to increase levels of expression and accomplish in vivo processing. 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 is a graphical representation of total protein versus leaf age in transgenic tobacco 
plants. 

Figure 2 is an electron micrograph showing Cry2Aa2 crystals in a transgenic tobacco leaf. 
25 Figure 3 is a photograph of leaves infected with P. syringae 5 days after inoculation. 
Figure 4 is a graph showing the results of an in vitro assay of P. aeruginosa. 
Figure 5 is two graphs showing oligomeric CTB expression levels as Total Soluble Protein. 
Figure 6 is a Western Blot Analysis of transgenic chloroplast expressed CTB and commercially 
available purified CTB antigen. 
30 Figure 7 is a Western Blot Analysis of heavy and light chains of Guy's 13 monoclonal antibody 
from plant chloroplasts. 
Figure 8 is a Western Blot of transgenic potato tubers, cv Desiree. 
Figure 9 is a frequency histogram including percentage Kennebec and Desiree transgenci 
plants expressing different HAS levels. 
35 Figure 10 is a Western Blot of E. coli protein extracts. 
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HUMAN SERUM ALBUMIN 

HSA is a monomelic globular protein and consists of a single, generally nonglycosylated, 
polypeptide chain of 585 amino acids (66.5 KDa and 17 disulfide bonds) with no 
postradiational modifications. It is composed of three structurally similar globular domains and 
5 the disulfides are positioned in repeated series of nine loop-link-loop structures centered around 
eight sequential Cys-Cys pairs. HSA is initially synthesized as pre-pro-albumin by the liver and 
released from the endoplasrnatic reticulum after removal of the aminoterminal prepeptide of 18 
amino acids. The pro-albumin is further processed in the Golgi complex where the other 6 
armnoterrninal residues of the propeptide are cleaved by a serine proteinase (12). This results in 
1 0 the secretion of the mature polypeptide of 585 amino acids. HSA is encoded by two codominant 
autosomic allelic genes. HSA belongs to the nmltigene family of proteins that include alpha- 
fetoprotein and human group-specific component (Gc) or vitamin D-binding family. HSA 
facilitates transfer of many ligands across organ circulatory interfaces such as in the liver, 
intestine, kidney and brain. In addition to blood plasma, serum alburnin is also found in tissues. 
15 HSA accounts for about 60% of the total protein in blood serum. In the serum of human adults, 
the concentration of albumin is 40 rng/rnl. 

The primary function of HSA is the maintenance of colloid osmotic pressure (COP) 
within the blood vessels. Its abundance makes it an important determinant of the 
pharmacokinetic behavior of many drugs. Reduced synthesis of HSA can be due to advanced 
20 liver disease, impaired intestinal absorption of nutrients or poor nutritional intake. Increased 
albumin losses can be due to kidney diseases (increased glomerular permeability to 
macromolecules in the nephrotic syndrome), intestinal diseases (protein-losing enteropathies) or 
exudative skin disorders (bums). Catabolic states such as chronic infections, sepsis, surgery, 
intestinal resection, trauma or extensive burns can also cause hypoalbuminemia. HSA is used in 
25 therapy of blood volume disorders, for example posthaemorrhagic acute hypovolemia or 
extensive burns, treatment of dehydration states, and also for cirrhotic and hepatic illnesses. It is 
also used as an additive in perfusion liquid for extracorporeal circulation. HSA is used clinically 
for replacing blood volume, but also has a variety of non-therapeutic uses, including its role as a 
stabilizer in formulations for other therapeutic proteins. HSA is a stabilizer for biological 
30 materials in nature and is used for preparing biological standards and reference materials. 
Furthermore, HSA is frequently used as an experimental antigen, a cell-culture constituent and a 
standard in clinical-chemistry tests. 

The expression and purification of recombinant HSA from various microorganisms has 
been reported previously (13-17). Saccharornyces cerevisiae has been used to produce HSA both 
35 intracellulary, requiring denaturation and refolding prior to analysis (18), and by secretion (19). 
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was equivalent structurally, but the recombinant product had lower levels of 
expression (recovery) and structural heterogeneity compared to the blood derived protein (20). 
HSA was also expressed in Kluyveromyces lactis, a yeast with good secretary properties 
achieving 1 g/liter in fed batch cultures (21). Ohtani et al (22) developed a HSA expression 
5 system using Pichia pastoris and established a purification method obtaining recombinant 
protein with similar levels of purity and properties as the human protein. In Bacillus subtilis, 
HSA could be secreted using bacterial signal peptides (15). HSA production in E. coli was 
successful but required additional in vitro processing with trypsin to yield the mature protein 
(14). Sijmons et al. (23) expressed HSA in transgenic potato and tobacco plants. Fusion of HSA 
10 to the plant PR-S presequence resulted in cleavage of the presequence at its natural site and 
secretion of correctly processed HSA, that was indistinguishable from the authentic human 
protein. The expression was 0.014% of the total soluble protein. However, none of these methods 
have been exploited commercially. 

Albumin is currently obtained by protein fractionation from plasma and is the world's 
15 most used intravenous protein, estimated at around 500 metric tons per year. Albumin is 
administered by intravenous injection of solutions containing 20% of albumin. The average 
dosage of albumin for each patient varies between 20-40 grams/day. The consumption of 
alburnin is around 700 kilograms per million habitants per year. In addition to the high cost, HSA 
has the risk of transirutting diseases as with other blood-derivative products. The price of 
20 albumin is about $3.7/g. Thus, the market of this protein approximately amounts to $ 2,600,000 
per million people per year (0.7 billion dollars per year in USA). Because of the high cost of 
albumin, synthetic macromolecules (like dextrans) are used to increase plasma colloidosmotic 
pressure. 

Commercial HSA is mainly prepared from human plasma. This source, hardly meets the 
25 requirements of the world market. The availability of human plasma is limited and careful heat 
treatment of the product prepared must be performed to avoid potential contamination of the 
product by hepatitis, HTV and other viruses. The costs of HSA extraction from blood are very 
high. In order to meet the demands of the large alburnin market with a safe product at a low cost, 
innovative production systems are needed. Plant biotechnology offers promise of obtaining safe 
30 and cheap proteins to be used to treat human diseases. 

INTERFERON ALPHA 

Interferons (TFNs) constitute a heterogeneous family of cytokines with antiviral, 
antigrowth, and immunomodulatory properties (24-26). Type I IFNs are acid-stable and 
35 constitute the first line of defence against viruses, both by displaying direct antiviral effects and 
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with the cytokine cascade and the immune system. Their function is to induce 
regulation of growth and differentiation of T cells. The human IFN-a family consists of at least 
22 intronless genes, 9 of which are pseudogenes and 13 expressed genes (subtypes) (27). Human 
IFN-a genes encode proteins of 188 or 189 amino acids. The first 23 amino acids constitute a 
5 signal peptide, and the other 165 or 166 amino acids form the mature protein, IFN-oc subtypes 
show 78-94 % homology at the nucleotide level. Presence of two disulfide bonds between Cys- 
l:Cys-99 and Cys-29:Cysl39 is conserved among all IFN-a species (28). Human IFN-a genes 
are expressed constitutively in organs of normal individuals (29,30). Individual IFN-a genes are 
differently expressed depending on the stimulus and they show restricted cell type expression 

10 (31). Although all IFN-a subtypes bind to a common receptor (32), several reports suggest that 
they show quantitatively distinct patterns of antiviral, growth inhibitory and irnmunornodulatory 
activities (33). IFN-a8 and IFN-a5 seem to have the greatest antiviral activity in liver tumour 
ceils HuH7 (33). EFN-a5 has, at least, the same antiviral activity as IFN-a2 in in vitro 
experiments (unpublished data in Dr. Prieto's lab). It has been shown recently that IFN-cx5 is the 

15 sole IFN-a subtype expressed in normal liver tissue (34). IFN-a5 expression in patients with 
chronic hepatitis C is reduced in the liver (34) and induced in mononuclear cells (35). 

Interferons are mainly known for their antiviral activities against a wide spectrum of 
viruses but also for their protective role against some non-viral pathogens. They are potent 
immunomodulators, possess direct antiproliferative activities and are cytotoxic or cytostatic for a 

20 number of different tumour cell types. IFN-a is mainly employed as a standard therapy for hairy 
cell leukaemia, metastasizing carcinoma and AIDS-associated angiogenic tumours of mixed 
cellularity known as kaposi sarcomas. It is also active against a number of other tumours and 
viral infections. For example, it is the current approved therapy for chronic viral hepatitis B 
(CHB) and C (CHC). The IFN-a subtype used for chronic viral hepatitis is IFN-a2. About 40% 

25 of patients with CHB and about 25% of patients with CHC respond to this therapy with sustained 
viral clearance. The usual doses of IFN-a are 5-10 MU (subcutaneous injection) three days per 
week for 4-6 months for CHB and 3 MU three days per week for 12 months for CHC. Three MU 
of IFNa2 represent approximately 15 Dg of recombinant protein- The response rate in patients 
with chronic hepatitis C can be increased by combining IFN-cc2 and ribavirin. This combination 

30 therapy, which considerably increases the cost of the therapy and causes some additional side 
effects, results in sustained biochemical and virological remission in about 40-50% of cases. 
Recent data suggest that pegilated interferon in weekly doses of 180 Dg can also increase the 
sustained response rate to about 40%. IFN~a5 is the only IFN-a subtype expressed in liver; this 
expression is reduced in patients with CHC and IFN-a5 seems to have one of the highest 
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ty in liver tumour cells (see above). An international patent to use IFN-a5 has 
been filed by Prieto's group to facilitate commercial development (36). 

Human interferons are currently prepared in microbial systems via recombinant DNA 
technology in amounts which cannot be isolated from natural sources (leukocytes, fibroblasts, 
5 lymphocytes). Different recombinant interferon-D genes have been cloned and expressed in E. 
coli (37a,b) or yeast (38) by several groups. Generally, the synthesized protein is not correctly 
folded due to the lack of disulfide bridges and therefore, it remains insoluble in inclusion bodies 
that need to be solubilized and refolded to obtain the active interferon (39,40). One of the most 
efficient methods of interferon-D expression has been published recently by Babu et aL (41). In 
1 0 this method, E. coli cells transformed with interferon vectors (regulated by temperature inducible 
promoters) were grown in high cell density cultures; this resulted in the production of 4 g 
interferon- Q/liter of culture. Expression resulted exclusively in the form of insoluble inclusion 
bodies which were solubilized under denaturing conditions, refolded and purified to near 
homogeneity. The yield of purified interferon-D was approximately 300mg/I of culture. 
15 Expression in plants via the nuclear genome has not been very successful. Smimov et al. (42) 
obtained transformed tobacco plants with Agrobacterium tumefaciens using the interferon-D 
gene under 35S CaMV promoter but the expression level was very low. Eldelbaum et al. (43) 
showed tobacco nuclear transformation with Interferon-D and the expression level detected was 
0.000017% of fresh weight. 
20 The number of subjects infected with hepatitis C virus (HCV) is estimated to be 120 

million (5 million in Europe and 4 million in USA). Seventy per cent of the infected people have 
abnormal liver function and about one third of these have severe viral hepatitis or cirrhosis. It 
might be estimated however that there are about 10,000-15,000 cases of chronic infection with 
hepatitis B virus (HBV) in Europe, a slightly lower number of cases in USA. In Asia the 
25 prevalence of chronic HCV and HBV infection is very high (about 110 million of people are 
infected by HCV and about 150 millions are infected by HBV). In Africa HCV infection is very 
prevalent. Since unremitting chronic viral hepatitis leads to liver cirrhosis and eventually to liver 
cancer, the high prevalence of HBV and HCV infection in Asia and Africa accounts for their 
very high incidence of hepatocellular carcinoma. Based on these data, the need for IFN-ct is 
30 large. IFN-a2 is currently produced in rnicroorganisms by a number of companies and the price 
of 3 MU (15 Og) of recombinant protein in the western market is about $25. Thus, the cost of 
one year IFN-a2 therapy is about $ 4,000 per patient. This price makes this product unavailable 
for most of the patients in the world suffering from chronic viral hepatitis. Clearly methods to 
produce less expensive recombinant proteins via plant biotechnology innovations would be 
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z antiviral therapy widely available. Besides, if IFN-a5 is more efficient than IFN- 
ot2, lower doses may be required. 

INSULIN-LIKE GROWTH FACTOR-I (IGF-I) 

5 The Insulin-like Growth Factor protein, IGF-I, is an anabolic hormone with a complex 

maturation process. A single IGF-I gene is transcribed into several mRNAs by alternative 
splicing and use of different transcription initiation sites (44-46). Depending on the choice of 
splicing, two immature proteins are produced: IGF-IA, expressed in several tissues and IGF-IB, 
mostly expressed in liver (45). Both pre-proteins produce the same mature protein. A and B 

10 immature forms have different lengths and composition, as their termini are modified post- 
translationally by glycosylation. However, these ends are processed in the last step of maturation. 
Mature IGF-I protein is secreted, not glycosylated and has three disulfide bonds, 70 amino acids 
and a molecular weight of 7.6 kD (47-49). Physiologically, IGF-I expression is induced by 
growth hormone (GH). Actually, the knock out of IGF-I in mice has shown that several functions 

15 attributed originally to GH are in fact mediated by IGF-I. GH production by adenohypofisis is 
repressed by feed-back inhibition of IGF-I. GH induces IGF-I synthesis in different tissues, but 
mostly in liver, where 90% of IGF-I is produced (48). The IGF-I receptor is expressed in 
different tissues. It is formed by two polypeptides: alpha that interacts with IGF-I and beta 
involved in signal transduction and also present in the insulin receptor (50,51). Thus, IGF-I and 

20 insulin activation are similar. 

IGF-I is a potent multifunctional anabolic hormone produced in the liver upon 
stimulation by growth hormone (GH). In liver cirrhosis the reduction of receptors for GH in 
hepatocytes and the diminished synthesis of the liver parenchyma cause a progressive fall of 
serum IGF-I levels. Patients with liver cirrhosis have a number of systemic Arrangements such 

25 as muscle atrophy, osteopenia, hypogonadism, protein-calorie malnutrition which could be 
related to reduced levels of circulating IGF-I. Recent studies from Prieto's laboratory have 
demonstrated that treatments with low doses of IGF-I induce significant improvements in 
nutritional status (52), intestinal absorption (53-55), osteopenia (56), hypogonadism (57) and 
liver function (58) in rats with experimental liver cirrhosis. These data support that IGF-I 

30 deficiency plays a pathogenic role in several systemic complications occurring in liver cirrhosis. 
The liver can be considered as an endocrine gland synthesising a hormone such as IGF-I with 
important physiological functions. Thus liver cirrhosis should be viewed as a disease 
accompanied by a hormone deficiency syndrome for which replacement therapy with IGF-I is 
warranted Clinical studies are in progress to ascertain the role of IGF-I in the management of 
35 cirrhotic patients. IGF-I is also being currently used for Laron dwarfism treatment These 
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liver GH receptor so IGF-I is not expressed (59). Also IGF-I, acting as a 
hypoglycemiant, is given together with insulin in diabetes mellitus (60,61). Anabolic effects of 
IGF-I are used in osteoporosis treatment (62,63) hypercatabolism and starvation due to burning 
and HTV infection (64,65). Unpublished studies indicate that IGF-I could also be used in patients 
5 with articular degenerative disease (osteoarthritis). 

The potency of IGF-I has encouraged a great number of scientists to try IGF-I expression 
in various microorganisms due to the small amount present in human plasma. Production of IGF- 
I in yeast was shown to have several disadvantages like low fermentation yields and risks of 
obtaiiung undesirable glycosylation in these molecules (66). Expression in bacteria has been the 

10 most successful approach, either as a secreted form fused to protein leader sequences (67) or 
fused to a solubHized affinity fusion protein (68). In addition, IGF-I has been produced as 
insoluble inclusion bodies fused to protective polypeptides (69). Sun-Ok Kim and Young Lee 
(70a) expressed IGF-I as a truncated beta-galactosidase fusion protein. The final purification 
yielded approximately 5 mg of IGF-I having native conformation per liter of bacterial culture. 

1 5 IGF-I has also been expressed in animals. Zinovieva et al. (70b) reported an expression of 0.543 
mg/ml in rabbit milk. 

IGF-I circulates in plasma in a fairly high concentration varying between 120-400 ng/mL 
In cirrhotic patients the values of IGF-I fall to 20 ng/ml and frequently to undetectable levels. 
Replacement therapy with IGF-I in liver cirrhosis requires administration of 1.5-2 mg per day for 

20 each patient Thus, every cirrhotic patient will consume about 600 mg per year. IGF-I is 
currently produced in bacteria (71). The high amount of recombinant protein needed for IGF-I 
replacement therapy in patients with liver cirrhosis will make this treatment exceedingly 
expensive if new methods for cheap production of recombinant proteins are not developed. 
Besides, as described above, IGF-I is used in treatment of dwarfism, diabetes, osteoporosis, 

25 starvation and hypercatabolism. IGF-I use in osteoarthritis is currently being investigated. 
Again, plant biotechnology could provide a solution to make economically feasible the 
application of IGF-I therapy to all these patients. 



30 SUMMARY OF THE INVENTION 

The present invention develops recombinant DNA vectors for enhanced expression of 
human serum albumin, insulin-like growth factor I, and interferon-0 2 and 5, via 
chloroplast genomes of tobacco, 
optimizes processing and purification of pharmaceutical proteins using chloroplast vectors in E. 
35 colU and 
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w ;nic tobacco plants. 

The transgenic expression of proteins or fusion proteins is characterized using molecular and 

biochemical methods in chloroplasts. 
Existing or modified methods of purification are employed on transgenic leaves. 
5 Mendelian or maternal inheritance of transgenic plants is analyzed. 

Large scale purification of therapeutic proteins from transgenic tobacco and comparison of 

current purification methods in E.coli or yeast is performed, and 
natural refolding in chloroplasts is compared with existing in vitro processing methods; 
Comparison/characterization (yield and purity) of therapeutic proteins produced in yeast or 
1 0 E.coli with transgenic tobacco chloroplasts is performed, as are 

In vitro and in vivo (pre-clinical trials) studies of protein biofunctionality. 

DETAILED DESCRIPTION OF THE INVENTION 

When the concept of chloroplast genetic engineering was developed (72,73), it was 

1 5 possible to introduce isolated intact chloroplasts into protoplasts and regenerate transgenic plants 
(74). Therefore, early investigations on chloroplast transformation focused on the development 
of in organello systems using intact chloroplasts capable of efficient and prolonged transcription 
and translation (75-77) and expression of foreign genes in isolated chloroplasts (78). However, 
after the discovery of the gene gun as a transformation device (79), it was possible to transform 

20 plant chloroplasts without the use of isolated plastids and protoplasts. Chloroplast genetic 
engineering was accomplished in several phases. Transient expression of foreign genes in 
plastids of dicots (80,81) was followed by such studies in monocots (82). Unique to the 
chloroplast genetic engineering is the development of a foreign gene expression system using 
autonomously replicating chloroplast expression vectors (SO). Stable integration of a selectable 

25 marker gene into the tobacco chloroplast genome (83) was also accomplished using the gene 
gun. However, useful genes conferring valuable traits via chloroplast genetic engineering have 
been demonstrated only recently. For example, plants resistant to B.t. sensitive insects were 
obtained by integrating the crylAc gene into the tobacco chloroplast genome (84). Plants resistant 
to B.t. resistant insects (up to 40,000 fold) were obtained by hyper-expression of the cry 2 A gene 

30 within the tobacco chloroplast genome (85). Plants have also been genetically engineered via the 
chloroplast genome to confer herbicide resistance and the introduced foreign genes were 
maternally inherited, overcoming the problem of out-cross with weeds (86). Chloroplast genetic 
engineering technology is currently being applied to other useful crops (73,87). 

A remarkable feature of chloroplast genetic engineering is the observation of 

35 exceptionally large accumulation of foreign proteins in transgenic plants, as much as 46% of 
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n total soluble protein, even in bleached old leaves (3). Stable expression of a 
pharmaceutical protein in chloropiasts was first reported for GVGVP, a protein based polymer 
with varied medical applications (such as the prevention of post-surgical adhesions and scars, 
wound coverings, artificial pericardia, tissue reconstruction and programmed drug delivery (88)). 

5 Subsequently, expression of the human somatotropin via the tobacco chloroplast genome (9) to 
high levels (7% of total soluble protein) was observed. The following investigations that are in 
progress in the Daniell laboratory illustrate the power of this technology to express small 
peptides, entire operons, vaccines that require oligomeric proteins with stable disulfide bridges 
and monoclonals that require assembly of heavy/light chains via chaperonins. 

0 In plant and animal cells, nuclear mRNAs are translated monocistronically. This poses a 

serious problem when engineering multiple genes in plants (91). Therefore, in order to express 
the polyhydroxybutyrate polymer or Guy's 13 antibody, single genes were first introduced into 
individual transgenic plants, then these plants were back-crossed to reconstitute the entire 
pathway or the complete protein (92,93). Similarly, in a seven year long effort, Ye et al. (81) 

5 recently introduced a set of three genes for a short biosynthetic pathway that resulted in p- 
carotene expression in rice. In contrast, most chloroplast genes of higher plants are cotranscribed 
(91). Expression of polycistrons via the chloroplast genome provides a unique opportunity to 
express entire pathways in a single transformation event. The Bacillus thuringiensis (Bt) 
cry2Aa2 operon has recently been used as a model system to demonstrate operon expression and 

0 crystal formation via the chloroplast genome (3). CrylAsQ. is the distal gene of a three-gene 
operon. The ^immediately upstream of ery2Aa2 codes for a putative chaperonin that facilitates 
the folding of cry2Aa2 (and other proteins) to form proteolyticalry stable cuboidal crystals (94). 

Therefore, the cry2Aa2 bacterial operon was expressed in tobacco chloropiasts to test the 
resultant transgenic plants for increased expression and improved persistence of the accumulated 

5 insecticidal protein(s). Stable foreign gene integration was confirmed by PCR and Southern blot 
analysis in To and Tj transgenic plants. Cry2Aa2 operon derived protein accumulated at 45.3% of 
the total soluble protein in mature leaves and remained stable even in old bleached leaves 
(46.l%)(Figure 1). This is the highest level of foreign gene expression ever reported in 
transgenic plants. Exceedingly difficult to control insects (10-day old cotton bollworm, 

0 beetarmy worm) were killed 100% after consuming transgenic leaves. Electron micrographs 
showed the presence of the insecticidal protein folded into cuboidal crystals similar in shape to 
Cry2Aa2 crystals observed in Bacillus thuringiensis (Figure 2). In contrast to currently marketed 
transgenic plants with soluble CRY proteins, folded protoxin crystals will be processed only by 
target insects that have alkaline gut pH; this approach should improve safety of Bt transgenic 

5 plants. Absence of insecticidal proteins in transgenic pollen ebrninates toxicity to non-target 
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Hen. In addition to these environmentally friendly approaches, this observation 
should serve as a model system for large-scale production of foreign proteins within chloroplasts 
in a folded configuration enhancing their stability and facilitating single step purification. This is 
the first demonstration of expression of a bacterial operon in transgenic plants and opens the 
5 door to engineer novel pathways in plants in a single transformation event. 

It is common knowledge that the medical community has been fighting a vigorous battle 
against drug resistant pathogenic bacteria for years. Cationic antibacterial peptides from 
mammals, amphibians and insects have gained more attention over the last decade (95). Key 
features of these cationic peptides are a net positive charge, an affinity for negatively-charged 

1 0 prokaryotic membrane phospholipids over neutral-charged eukaryotic membranes and the ability 
to form aggregates that disrupt the bacterial membrane (96). 

There are three major peptides with a-helical structures, cecropin from Hyalophora 
cecropia (giant silk moth), magainins from Xenopus laevis (African frog) and defensins from 
mammalian neutrophils. Magainin and its analogues have been studied as a broad-spectrum 

15 topical agent, a systemic antibiotic; a wound-healing stimulant; and an anticancer agent (97). We 
have recently observed that a synthetic lytic peptide (MSI-99, 22 amino acids) can be 
successfully expressed in tobacco chloroplast (98). The peptide retained its lytic activity against 
the phytopathogenic bacteria Pseudomonas syringae and multidrug resistant human pathogen, 
Pseudomonas aeruginosa. The anti-microbial peptide (AMP) used in this study was an 

20 amphipathic alpha-helix molecule that has an affinity for negatively charged phospholipids 
commonly found in the outer-membrane of bacteria. Upon contact with these membranes, 
individual peptides aggregate to form pores in the membrane, resulting in bacterial lysis. 
Because of the concentration dependent action of the AMP, it was expressed via the chloroplast 
genome to accomplish high dose delivery at the point of infection. PCR products and Southern 

25 blots confirmed chloroplast integration of the foreign genes and homoplasmy. Growth and 
development of the transgenic plants was unaffected by hyper-expression of the AMP within 
chloroplasts. In vitro assays with To and Ti plants confirmed that the AMP was expressed at 
high levels (21.5 to 43% of the total soluble protein) and retained biological activity against 
Pseudomonas syringae, a major plant pathogen. In situ assays resulted in intense areas of 

30 necrosis around the point of infection in control leaves, while transformed leaves showed no 
signs of necrosis (200-800 jxg of AMP at the site of mfection)(Figure 3). Ti in vitro assays 
against Pseudomonas aeruginosa (a multi-drug resistant human pathogen) displayed a 96% 
inhibition of growth (Figure 4). These results give a new option in the battle against 
phytopathogenic and drug-resistant human pathogenic bacteria. Small peptides (like insulin) are 
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tost organisms. However, stability of this AMP in chloroplasts opens up this 
compartment for expression of hormones and other small peptides. 

Expression of cholera toxin (3 subunit oligomers as a vaccine in chloroplasts 

5 Vibrio cholerae, which causes acute watery diarrhea by colonizing the small intestine and 

producing the enterotoxin, cholera toxin (CT). Cholera toxin is a hexameric AB5 protein 
consisting of one toxic 27kDa A subunit having ADP ribosyi transferase activity and a nontoxic 
pentarner of 11.6 kDa B subunits (CTB) that binds to the A subunit and facilitates its entry into 
the intestinal epithelial cells. CTB when administered orally (99) is a potent mucosal immunogen 

10 which can neutralize the toxicity of the CT holotoxin by preventing it from binding to the 
intestinal cells (100). This is believed to be a result of it binding to eukaryotic cell surfaces via 
the GmI gangliosides, receptors present on the intestinal epithelial surface, thus eliciting a 
mucosal immune response to pathogens (101) and enhancing the immune response when 
chemically coupled to other antigens (102-105). 

15 Cholera toxin (CTB) has previously been expressed in nuclear transgenic plants at levels 

of 0.01 (leaves) to 0.3% (tubers) of the total soluble protein. To increase expression levels, we 
engineered the chloroplast genome to express the CTB gene (10). We observed expression of 
oligomeric CTB at levels of 4-5% of total soluble plant protein (Figure 5 A). PCR and Southern 
Blot analyses confirmed stable integration of the CTB gene into the chloroplast genome. Western 

20 blot analysis showed that transgenic chloroplast expressed CTB was antigenically identical to 
commercially available purified CTB antigen (Figure 6). Also, Gj^i-ganglioside binding assays 
confirm that chloroplast synthesized CTB binds to the intestinal membrane receptor of cholera 
toxin (Figure 5B). Transgenic tobacco plants were morphologically indistinguishable from 
untransformed plants and the introduced gene was found to be stably inherited in the subsequent 

25 generation as confirmed by PCR and Southern Blot analyses. The increased production of an 
efficient transmucosal carrier molecule and delivery system, like CTB, in chloroplasts of plants 
makes plant based oral vaccines and fusion proteins with CTB needing oral administration^ a 
much more feasible approach. This also establishes unequivocally that chloroplasts are capable 
of forming disulfide bridges to assemble foreign proteins. 

30 

Expression and assembly of monoelonals in transgenic chloroplasts 

Dental caries (cavities) is probably the most prevalent disease of humankind- 
Colonization of teeth by S. mutans is the single most important risk factor in the development of 
dental caries. S. mutans is a non-motile, gram positive coccus. It colonizes tooth surfaces and 
35 synthesizes glucans (insoluble polysacchr- ; ' lrtX — 1 fhictans from sucrose using the enzymes 
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rase and fiuctosyltransferase respectively (106a). The glucans play an important 
role by allowing the bacterium to adhere to the smooth tooth surfaces. After its adherence, the 
bacterium ferments sucrose and produces lactic acid. Lactic acid dissolves the minerals of the 
tooth, producing a cavity. 

5 A topical monoclonal antibody therapy to prevent adherence of S. mutatis to teeth has 

recently been developed. The incidence of cariogenic bacteria (in humans and animals) and 
dental caries (in animals) was dramatically reduced for periods of up to two years after the 
cessation of the antibody therapy. No adverse events were detected either in the exposed animals 
or in human volunteers (106b). The annual requirement for this antibody in the US alone may 

10 eventually exceed 1 metric ton. Therefore, this antibody was expressed via the chloropiast 
genome to achieve higher levels of expression and proper folding (11). The integration of 
antibody genes into the chloropiast genome was confirmed by PCR and Southern blot analysis. 
The expression of both heavy and light chains was confirmed by western blot analysis under 
reducing conditions (Figure 7A,B). The expression of fully assembled antibody was confirmed 

15 by western blot analysis under non-reducing conditions (Figure 7C). This is the first report of 
successful assembly of a multi-subunit human protein in transgenic chloroplasts. Production of 
monoclonal antibodies at agricultural level should reduce their cost and create new applications 
of monoclonal antibodies. 

HUMAN SERUM ALBUMIN 

20 Nuclear transformation 

The human HSA cDNA was cloned from human liver cells and the patatin promoter 
(whose expression is tuber specific (107)) fused along with the leader sequence of PIN H 
(proteinase II inhibitor potato transit peptide that directs HSA to the apoplast (108)). Leaf discs 
of Desiree and Kennebec potato plants were transformed using Agrobacterium tumefaciens. A 

25 total of 98 transgenic Desiree clones and 30 Kennebec clones were tested by PCR and western 
blots. Western blots showed that the recombinant albumin (rHSA) had been properly cleaved by 
the proteinase II inhibitor transit peptide (Figure 8). Expression levels of both cultivars were very 
different among all transgenic clones as expected (Figure 9), probably because of position effects 
and gene silencing (89,90). The population distribution was similar in both cultivars: majority of 

30 transgenic clones showed expression levels between 0.04 and 0.06% of rHSA in the total soluble 
protein. The maximum recombinant HSA amount expressed was 0.2%. Between one and five T- 
DNA insertions per tetraploid genome were observed in these clones. Plants with higher protein 
expression were always clones with several copies of the HSA gene. Levels of rnRNA were 
analyzed by Northern blots. There was a correlation between transcript levels and recombinant 

35 albumin accumulation in transgenic tubers. The N-terrninal sequence showed proper cleavage of 
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^ dde and the amino tenninal sequence between recombinant and human HSA was 
identical. Inhibition of patatin expression using the antisense technology did not improve the 
amount of rHSA, Average expression level among 29 transgenic plants was 0.032% of total 
soluble protein, with a maximum expression of 0.1%. 
5 Transformation of the tobacco chloroplast genome was initiated for hyperexpression of 

HSA. The codon composition is ideal for chloroplast expression and no changes in nucleotide 
sequences were necessary. For all the constructs pLD vector was used. Several vectors were 
designed to optimize HSA expression. All these contained ATG as the first amino acid of the 
mature protein. 

10 RBS-ATG-HSA 

The first vector included the gene that codes for the mature HSA plus an additional ATG 
as a translation initiation codon. We included the ATG in one of the primers of the PCR, 5 
nucleotides downstream of the chloroplast preferred RBS sequence GGAGG, The cDNA 
sequence of the mature HSA (cloned in Dr. Mingo-CastePs laboratory) was used as a template. 

15 The PCR product was cloned into PCR 2.1 vector, excised as an EcoRl-NotI fragment and 
introduced into the pLD vector. 
5'UTRpsbA-ATG-HSA 

The 200 bp tobacco chloroplast DNA fragment containing the 5' psbA UTR was 
amplified using PCR and tobacco DNA as template. The fragment was cloned into PCR 2.1 

20 vector, excised EcoRI-Ncol fragment was inserted at the Ncol site of the ATG-HSA and finally 
inserted into the pLD vector as an EcoRI-NotI fragment downstream of the 16S rRNA promoter 
, to enhance translation of the protein. 
BtORJFl+2-ATG-HSA 

ORF1 and ORF2 of the Bt Cry2Aa2 operon were amplified in a PCR using the complete 

25 operon as a template. The fragment was cloned into PCR 2.1 vector, excised as an EcoRI-EcoRV 
fragment, inserted at EcoRV site with the ATG-HSA sequence and introduced into the pLD 
vector as an EcoRI-Notl fragment. The ORF1 and ORF2 were fused upstream of the ATG-HSA. 

Because of the similarity of protein synthetic machinery (109), expression of all 
chloroplast vectors was first tested in E.coli before their use in tobacco transformation. Different 

30 levels of expression were obtained in E. coli depending on the construct (Figure 10). Using the 
psbA 5' UTR and the ORF1 and ORF2 of the ay2Aa2 operon, we obtained higher levels of 
expression than using only the RBS. We have observed in previous experiments that HSA in E. 
coli is completely insoluble (as is shown in ref 14), probably due to an improper folding resulting 
from the absence of disulfide bonds. This is the reason why the protein is precipitated in the gel 

35 (Figure 10). Different polypeptide sizes were observed, probably due to incomplete translation. 
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E. coli and chloroplast have similar protein synthesis machinery, one could expect 
different levels of expression in transgenic tobacco chloroplasts depending on the regulatory 
sequences, with the advantage that disulfide bonds are formed in chloroplasts (9). These three 
vectors were bombarded into tobacco leaves via particle bombardment (110) and after 4 weeks 
5 small shoots appeared as a result of independent transformation events. 

INTERFERON- □ 5 

Interferon- D5 has not been expressed yet as a commercial recombinant protein. The first 
attempt has been made recently. The IFN-05 gene was cloned and the sequence of the mature 

10 protein was inserted into the pET28 vector, that included the ATG, histidine tag for purification 
and thrombin cleavage sequences. The tagged IFN-D5 was purified first by binding to a nickel 
column and biotinylated thrombin was then used to eliminate the tag on IFN-D □. Biotinylated 
thrombin was removed from the preparation using streptavidin agarose. The expression level was 
5.6 micrograms per liter of broth culture and the recombinant protein was active in antiviral 

1 5 activity similar or higher than commercial IFN-D 2 (Intron A, Schering Plouth). 

Tnsulin-like Growth Factor-I (IGF-H 

Recent studies have demonstrated that treatment with low doses of IGF-I induced 
significant improvements in nutritional status (52), intestinal absorption (53-55), osteopenia (56), 
20 hypogonadism (57) and liver function (58) in rats with experimental liver cirrhosis. These data 
support that IGF-I deficiency plays a pathogenic role in several systemic complications occurring 
in liver cirrhosis. Clinical studies are in progress to ascertain the role of IGF-I in the 
management of cirrhotic patients. Unpublished studies indicate that IGF-I could also be used in 
patients with articular degenerative disease (osteoarthritis). 

25 

Experimental 
Example 1 

Evaluation of chloroplast gene expression 

A systematic approach is used to identify and overcome potentid limitations of foreign 

30 gene expression in chloroplasts of transgenic plants. This experiment increases the utility of 
chloroplast transformation system by scientists interested in expressing other foreign proteins. 
Therefore, it is important to systematically analyze transcription, RNA abundance, RNA 
stability, rate of protein synthesis and degradation, proper folding and biological activity. The 
rate of transcription of the introduced HSA gene is compared with the highly expressing 

35 endogenous chloroplast genes (rbcL, psbA, 16S rRNA), using run on transcription assays to 
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le I6S1&NA promoter is operating as expected. The transcription efficiency of 
transgenic chloroplast containing each of the three constructs with different 5' regions is tested. 
Similarly, transgene RNA levels are monitored by northerns, dot blots and primer extension 
relative to endogenous rbcL, 16S rRNA or psbA. These results, along with run on transcription 
5 assays, provide valuable information of RNA stability, processing, etc. RNA appears to be 
extremely stable based on northern blot analysis. This systematic study is valuable to advance 
utility of this system by other scientists. Most importantly, the efficiency of translation is tested 
in isolated chloroplasts and compared with the highly translated chloroplast protein (psbA). 
Pulse chase experiments help assess if translations! pausing, premature termination occurs. 

1 0 Evaluation of percent RNA loaded on polysomes or in constructs with or without 5*UTRs helps 
to determine the efficiency of the ribosorne binding site and 5 f stem-loop translational enhancers. 
Codon optimized genes (IGF-I, IFN) are compared with unmodified genes to investigate the rate 
of translation, pausing and termination. A 200-fold difference in accumulation of foreign 
proteins due to decreases in proteolysis conferred by a putative chaperonin (3) was observed. 

1 5 Therefore, proteins from constructs expressing or not expressing the putative chaperoriin (with or 
without ORF1+2) provide valuable information on protein stability. 

Example 2 

Expression of the mature protein 

20 HSA ? Interferon and IGF-I are pre-proteins that need to be cleaved to secrete mature 

proteins. The codon for translation initiation is in the presequence. In chloroplasts, the necessity 
of expressing the mature protein forces introduction of this additional amino acid in coding 
sequences. In order to optimize expression levels, we first subclone the sequence of the mature 
proteins beginning with an ATG. Subsequent immunological assays in mice demonstrates the 

25 exlra-methionine causes immunogenic response arid low bioactivity. Alternatively, systems may 
also produce the mature protein. These systems can include the synthesis of a protein fused to a 
peptide that is cleaved intracellulary (processed) by chloroplast enzymes or the use of chemical 
or enzymatic cleavage after partial purification of proteins from plant cells. 

30 Use of peptides that are cleaved in chloroplast 

Staub et al. (9) reported chloroplast expression of human somatotropin similar to the 
native human protein by using ubiquitin fusions that were cleaved in the stroma by an ubiquitin 
protease. However, the processing efficiency ranged from 30-80% and the cleavage site was not 
accurate. In order to process chloroplast expressed proteins a peptide which is cleaved in the 
35 stroma is essential. The transit peptide sequence of the RuBisCo (ribulose 1,5-bisphosphate 
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mall subunit is an ideal choice. This transit peptide has been studied in depth 
(111). RuBisCo is one of the proteins that is synthesized in cytoplasm and transported 
postranslationally into the chloroplast in an energy dependent process. The transit peptide is 
proteolytically removed upon transport in the stroma by the stromal processing peptidase (112). 
5 There are several sequences described for different species (113). A transit peptide consensus 
sequence for the RuBisCo small subunit of vascular plants is published by Keegstra et al. (1 14). 
The amino acids that are proximal to the C-terrninal (41-59) are highly conserved in the higher 
plant transit sequences and belong to the domain which is involved in enzymatic cleavage (1 1 1). 
The RuBisCo small subunit transit peptide has been fused with various marker proteins 

10 (114,115), even with animal proteins (116,117), to target proteins to the chloroplast. Prior to 
transformation studies, the cleavage efficiency and accuracy are tested by in vitro translation of 
the fusion proteins and in organello import studies using intact chloroplasts. Thereafter, 
knowing the correct fusion sequence for producing the mature protein, such sequence encoding 
the amino tenninal portion of tobacco chloroplast transit peptide is linked with the mature 

15 sequence of each protein. Codon composition of the tobacco RuBisCo small subunit transit 
peptide is compatible with chloroplast optimal translation (see section d3 and table 1 on page 
30). Additional transit peptide sequences for targeting and cleavage in the chloroplast have been 
described (111). The lumen of thylakoids could also be a good target because thylakoids are 
readily purified. Lumenal proteins can be freed either by sonication or with a very low triton 

20 X100 concentration, although this requires insertion of additional amino acid sequences for 
efficient import (1 1 1). 

Example 3 

Use of chemical or enzymatic cleavage 

25 The strategy of fusing a protein to a tag with affinity for a certain ligand has been used 

extensively for more than a decade to enable affinity purification of recombinant products (1 18- 
120). A vast number of cleavage methods, both chemical and enzymatic, have been investigated 
for this purpose (120). Chemical cleavage methods have low specificity and the relatively harsh 
cleavage conditions can result in chemical modifications of the released products (120). Some of 

30 the enzymatic methods offer significantly higher cleavage specificities together with high 
efficiency, e. g. H64A subtilisin, IgA protease and factor Xa (119,120), but these enzymes have 
the drawback of being quite expensive. 

Trypsin, which cleaves C-terminal of basic amino-acid residues, has been used for a long 
time to cleave fusion proteins (14,121). Despite expected low specificity, trypsin has been shown 

35 to be useful for specific cleavage of fusion proteins, leaving basic residues within folded protein 
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avaged (121). The use of trypsin only requires that the N-terminus of the mature 
protein be accessible to the protease and that the potential internal sites are protected in the 
native conformation. Trypsin has the additional advantage of being inexpensive and readily 
available. In the case of HS A, when it was expressed in E. coli with 6 additional codons coding 
5 for a trypsin cleavage site, HSA was processed successfully into the mature protein after 
treatment with the protease. In addition, the N-terminal sequence was found to be unique and 
identical to the sequence of natural HSA, the conversion was complete and no degradation 
products were observed (14). This in vitro maturation is selective because correctly folded 
albumin is highly resistant to trypsin cleavage at inner sites (14). This system could be tested for 
1 0 chloroplasts HSA vectors using protein expressed in E. coli. 

Staub et al. (9) demonstrated that the chloroplast methionine arninopeptidase is active and 
they found 95% of removal of the first methionine of an ATG-somatotropin protein that was 
expressed via the chloroplast genome. There are several investigations that have shown a very 
strict pattern of cleavage by this peptidase (122). Methionine is only removed when second 
1 5 residues are glycine, alanine, serine, cysteine, threonine, proline or valine, but if the third amino 
acid is proline the cleavage is inhibited. In the expression of our three proteins we use this 
approach to obtain the mature protein in the case of Interferon because the penultimate 
aminoacid is cystein followed by aspartic acid. For HSA the second aminoacid is aspartic acid 
and for IGF-I glycine but it is followed by proline, so the cleavage is not dependable. 
20 For IGF-I expression, the use of the TEV protease (Gibco cat n 10127-017) would be 

ideal. The cleavage site that is recognized for this protease is Glu-Asn-Leu-Tyr-Phe-Gln-Gly and 
it cuts between Gln-Gly. This strategy allows the release of the mature protein by incubation 
with TEV protease leaving a glycine as the first amino acid consistent with human mature IGF-I 
protein. 

25 The purification system of the E. coli Interferon-05 expression method was based on 6 

Histidine-tags that bind to a nickel column and biotinylated thrombin to eliminate the tag on 
IFN-D5. Thrombin recognizes Leu-Val-Pro-Arg-Gly-Ser and cuts between Arg and Gly. This 
leaves two extra amino acids in the mature protein, but antiviral activity studies have shown that 
this protein is at least as active as commercial IFN-D2. 

30 

Example 4 

Optimization of gene expression 

Foreign genes are expressed between 3% (cry2Aa2) and 47% (cry2Aa2 operon) in 
transgenic chloroplasts (3,85). Based on the outcome of the evaluation of HSA chloroplast 
35 transgenic plants, several approaches can be used to enhance translation of the recombinant 
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loroplasts, transcriptional regulation of gene expression is less important, although 
some modulations by light and developmental conditions are observed (123). RNA stability 
appears to be one among the least problems because of observation of excessive accumulation of 
foreign transcripts, at times 16,966-fold higher than the highly expressing nuclear transgenic 
5 plants (124). Chloroplast gene expression is regulated to a large extent at the post-transcriptional 
level. For example, 5* UTRs are necessary for optimal translation of chloroplast mRNAs. Shine- 
Dalgarno (GGAGG) sequences, as well as a stem-loop structure located 5' adjacent to the SD 
sequence, are required for efficient translation. A recent study has shown that insertion of the 
psbA 5' UTR downstream of the 16S rRNA promoter enhanced translation of a foreign gene 

10 (GUS) hundred-fold (125a). Therefore, the 200-bp tobacco chloroplast DNA fragment (1680- 
1480) containing 5' psbA UTR should be used. This PCR product is inserted downstream of the 
16S rRNA promoter to enhance translation of the recombinant proteins. 

Yet another approach for enhancement of translation is to optimize codon compositions. 
Since ail the three proteins are translated in E. coli (see section b), it would be reasonable to 

15 expect efficient expression in chloroplasts. However, optimizing codon compositions to match 
the psbA gene could further enhance the level of translation. Although rbcL (RuBisCO) is the 
most abundant protein on earth, it is not translated as highly as the psbA gene due to the 
extremely high turnover of the psbA gene product. The psbA gene is under stronger selection 
for increased translation efficiency and is the most abundant thylakoid protein. In addition, the 

20 codon usage in higher plant chloroplasts is biased towards the NNTC codon of 2-fold degenerate 
groups (i.e. TTC over TTT, GAC over GAT, CAC over CAT, AAC over AAT, ATC over ATT, 
ATA etc.). This is in addition to a strong bias towards T at third position of 4-fold degenerate 
groups. There is also a context effect that should be taken into consideration while modifying 
specific codons. The 2-fold degenerate sites immediately upstream from a GNN codon do not 

25 show this bias towards NNC. (TTT GGA is preferred to TTC GGA while TTC CGT is preferred 
to TTT CGT, TTC AGT to TTT AGT and TTC TCT to TTT TCT)(125b,126). In addition, 
highly expressed chloroplast genes use GNN more frequently that other genes, Codon 
composition was optimized by comparing different species. Abundance of amino acids in 
chloroplasts and tRNA anticodons present in chloroplast must be taken into consideration. We 

30 also compared A+T% content of all foreign genes that had been expressed in transgenic 
chloroplasts in our laboratory with the percentage of chloroplast expression. We found that 
higher levels of A+T always correlated with high expression levels (see table 1). It is also 
possible to modify chloroplast protease recognition sites while modifying codons, without 
affecting their biological functions. 

35 The study of the sequences of HSA, IGF-I and Interferon-GS was done. The HSA 
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ed 57% of A+T content and 40% of the total codons matched with the psbA most 
translated codons. According to the data of table 1 } we expected good chloroplast expression of 
the HSA gene without any modifications in its codon composition. IFN-D5 has 54% of A+T 
content and 40% of matching with psbA codons. The composition seems to be good but this 
5 protein is small (166 amino acids) and the sequence was optimized to achieve A+T levels close 
to 65%. Finally, the analysis of the IGF-I sequence showed that the A+T content was 40% and 
only 20% of the codons are the most translated in psbA. Therefore, this gene needed to be 
optimized. Optimization of these two genes is done using a novel PCR approach (127,128) 
which has been successfully used to optimize codon composition of other human proteins. 

10 

Example 5 

Vector constructions 

For all the constructs pLD vector is used. This vector was developed in this laboratory for 
chloroplast transformation. It contains the 16S rRNA promoter (Prm) driving the selectable 

15 marker gene aadA (aminoglycoside adenyl transferase conferring resistance to spectinomycin) 
followed by the psbA V region (the terminator from a gene coding for photosystem H reaction 
center components) from the tobacco chloroplast genome. The pLD vector is a universal 
chloroplast expression /integration vector and can be used to transform chloroplast genomes of 
several other plant species (73,86) because these flanking sequences are highly conserved among 

20 higher plants. The universal vector uses trnA and trnl genes (chloroplast transfer RNAs coding 
for Alanine and Isoleucine) from the inverted repeat region of the tobacco chloroplast genome as 
flanking sequences for homologous recombination. Because the universal vector integrates 
foreign genes within the Inverted Repeat region of the chloroplast genome, it should double the 
copy number of the transgene (from 5000 to 10,000 copies per cell in tobacco). Furthermore, it 

25 has been demonstrated that homoplasmy is achieved even in the first round of selection in 
tobacco probably because of the presence of a chloroplast origin of replication within the 
flanking sequence in the universal vector (thereby providing more templates for integration). 
Because of these and several other reasons, foreign gene expression was shown to be much 
higher when the universal vector was used instead of the tobacco specific vector (88). 

30 

The following vectors are used to optimize protein expression, purification and production of 
proteins with the same amino acid composition as in human proteins. 

a) In order to optimize expression, translation is increased using the psbA 5'UTR and 
35 optimizing the codon composition for protein expression in chloroplasts according to criteria 
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:>reviously. The 200 bp tobacco chloroplast DNA fragment containing 5' psbA 
UTR is amplified by PCR using tobacco chloroplast DNA as template. This fragment is 
cloned directly in the pLD vector multiple cloning site (EcoRI-NcoI) downstream of the 
promoter and the aadA gene. The cloned sequence is exactly the same as in the psbA gene. 

5 

b) For enhancing protein stability and facilitating purification, the cry2Aa2 Bacillus 
tkuringiensis operon derived putative chaperonin is used Expression of the'c?y2Aa2 operon 
in chloroplasts provides a model system for hyper-expression of foreign proteins (46% of 
total soluble protein) in a folded configuration enhancing their stability and facilitating 

10 purification (3). This justifies inclusion of the putative chaperonin from the cry2Aa2 operon 

in one of the newly designed constructs. In this region there are two open reading frames 
(ORF1 and ORF2) and a ribosomal binding site (rbs). This sequence contains elements 
necessary for Cry2Aa2 crystallization which help to crystallize the HSA, IGF-I and IFN-D 
proteins aiding in the subsequent purification. Successful crystallization of other proteins 

1 5 using this putative chaperonin has been demonstrated (94). We amplify the ORF1 and ORF2 

of the Bt Cty2Aa2 operon by PCR using the complete operon as template. The fragment is 
cloned into a PCR 2. 1 vector and excised as an EcoRI-EcoRV product. This fragment is then 
cloned directly into the pLD vector multiple cloning site (EcoRI-EcoRV) downstream of the 
promoter and the aadA gene. 

20 

c) To obtain proteins with the same amino acid composition as mature human proteins, we first 
fuse all three genes (codon optimized and native sequence) with the RuBisCo small summit 
transit peptide. Also other constructions are done to allow cleavage of the protein after 
isolation from chloroplast These strategies also allow affinity purification of the proteins. 

25 

The first set of constructs includes the sequence of each protein beginning with an ATG, 
introduced by PCR using primers. Processing to get the mature protein may be performed where 
the ATG is shown to be a problem (determined by mice immunological assays). First, we use 
the RuBisCo small subunit transit peptide. This transit peptide is amplified by PCR using 

30 tobacco DNA as template and cloned into the PCR 2.1 vector. All genes are fused with the 
transit peptide using a Mini restriction site that is introduced in the PCR primers for 
amplification of the transit peptide and genes coding for three proteins. The gene fusions are 
inserted into the pLD vectors downstream of the 5*UTR or ORF1+2 using the restriction sites 
Ncol and EcoRV respectively. If use of tags or protease sequences is necessary, such sequences 

35 can be introduced by designing primers including these sequences and amplifying the gene with 
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mpleting vector constructions, all the vectors are sequenced to confirm correct 
nucleotide sequence and in frame fusion. DNA sequencing is done using a Perkin Elmer ABI 
prism 373 DNA sequencing system. 

Because of the similarity of protein synthetic machinery (109), expression of all 
5 chloroplast vectors is first tested in E.coli before their use in tobacco transformation. For 
Escherichia coli expression XL-1 Blue strain is used. E. coli can be transformed by standard 
CaCb transformation procedures and grown in TB culture media. Purification, biological and 
immunogenic assays are done using E. coli expressed proteins. 

10 Example 6 

Bombardment, Regeneration and Characterization of Chloroplast Transgenic Plants 

Tobacco {Nicotiana tabacum var. Petit Havana) plants are grown aseptically by 
germination of seeds on MSO medium. This medium contains MS salts (4.3 g/liter), B5 vitamin 
mixture (myo-inositol, 100 mg/liter; thiamine-HCl, 10 mg/liter, nicotinic acid, 1 mg/liter, 

15 pyridoxine-HCI, 1 mg/liter), sucrose (30 g/liter) and phytagar (6 g/liter) at pH 5.8. Fully 
expanded, dark green leaves of about two month old plants are used for bombardment. 

Leaves are placed abaxial side up on a Whatman No. 1 filter paper laying on the RMOP 
medium (79) in standard petri plates (100x15 mm) for bombardment. Gold (0.6 um) 
microprojectiles are coated with plasmid DNA (chloroplast vectors) and bombardments are 

20 carried out with the biolistic device PDSIOOO/He (Bio-Rad) as described by Daniell (110). 
Following bombardment, petri plates are sealed with parafilm and incubated at 24°C under 12 h 
photoperiod. Two days after bombardment, leaves axe chopped into small pieces of —5 mm 2 in 
size and placed on the selection medium (RMOP containing 500 jxg/ml of spectinomycin 
duiydrochloride) with abaxial side touching the medium in deep (100x25 mm) petri plates (^10 

25 pieces per plate). The regenerated spectinomycin resistant shoots are chopped into small pieces 
(—2mm 2 ) and subcloned into fresh deep petri plates (-5 pieces per plate) containing the same 
selection medium. Resistant shoots from the second culture cycle are then transferred to the 
rooting medium (MSO medium supplemented with IB A, 1 mg/liter and spectinomycin 
dihydrochloride, 500 mg/liter). Rooted plants are transferred to soil and grown at 26°C under 16 

30 hour photoperiod conditions for further analysis. 

PCR analysis of putative transformants 

PCR is done using DNA isolated from control and transgenic plants in order to 
distinguish a) true chloroplast transformants from mutants and b) chloroplast transform ants from 
35 nuclear transformants. Primers for testing the presence of the aadA gene (that confers 
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resistance) in transgenic plants are landed on the aadA coding sequence and 16S 
rRNA gene. In order to test chloroplast integration of the genes, one primer lands on the aadA 
gene while another lands on the native chloroplast genome. No PCR product is obtained with 
nuclear transgenic plants using this set of primers. The primer set is used to test integration of the 

5 entire gene cassette without any internal deletion or looping out during homologous 
recombination. Similar strategy was used successfully to confirm chloroplast integration of 
foreign genes (3,85-88), This screening is essential to elrniinate mutants and nuclear 
transformants. In order to conduct PCR analyses in transgenic plants, total DNA from 
unbombarded and transgenic plants is isolated as described by Edwards et al. (129). Chloroplast 

10 transgenic plants containing the desired gene are then moved to second round of selection in 
order to achieve homoplasrny. 



Southern Analysis for homoplasrny and copy number 

Southern blots are done to determine the copy number of the introduced foreign gene per 

15 ceil as well as to test homoplasrny. There are several thousand copies of the chloroplast genome 
present in each plant cell. Therefore, when foreign genes are inserted into the chloroplast 
genome, some of the chloroplast genomes have foreign genes integrated while others remain as. 
the wild type (heteroplasmy). Therefore, in order to ensure that only the transformed genome 
exists in cells of transgenic plants (homoplasrny), the selection process is continued. In order to 

20 confirm that the wild type genome does not exist at the end of the selection cycle, total DNA 
from transgenic plants are probed with the chloroplast border (flanking) sequences (the trnl-trnA 
fragment). When wild type genomes are present (heteroplasmy), the native fragment size is 
observed along with transformed genomes. Presence of a large fragment (due to insertion of 
foreign genes within the flanking sequences) and absence of the native small fragment confirms 

25 homoplasrny (85,86,88). 

The copy number of the integrated gene is determined by establishing homoplasrny for 
the transgenic chloroplast genome. Tobacco chloroplasts contain 5000-10,000 copies of their 
genome per cell (86). If only a fraction of the genomes are actually transformed, the copy 
number, by default, must be less than 10,000. By establishing that in the transgenics the gene 

30 inserted transformed genome is the only one present, one can establish that the copy number is 
5000-10,000 per cell. This is usually done by digesting the total DNA with a suitable restriction 
enzyme and probing with the flanking sequences that enable homologous recombination into the 
chloroplast genome. The native fragment present in the control should be absent in the 
transgenics. The absence of native fragment proves that only the transgenic chloroplast genome 

35 is present in the cell and there is no native, untransformed, chloroplast genome, without the 
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present. This establishes the homopksmic nature of our transformants, 
simultaneously providing us with an estimate of 5000-10,000 copies of the foreign genes per 
cell. 

5 Northern Analysis for transcript stability 

Northern blots are done to test the efficiency of transcription of the genes. Total RNA is 
isolated from 150 rng of frozen leaves by using the "Rneasy Plant Total RNA Isolation Kit" 
(Qiagen Inc., Chatsworth, CA). RNA (10-40 jug) is denatured by formaldehyde treatment, 
separated on a 1.2% agarose gel in the presence of formaldehyde and transferred to a 
10 nitrocellulose membrane (MSI) as described in Sambrook et al. (130). Probe DNA (proinsulin 
gene coding region) is labeled by the random-primed method (Promega) with 32 P-dCTP isotope. 
The blot is pre-hybridized, hybridized and washed as described above for southern blot analysis. 
Transcript levels are quantified by the Molecular Analyst Program using the GS-700 Imaging 
Densitometer (Bio-Rad, Hercules, CA). 

15 

Expression and quantification of the total protein expressed in chloroplast 

Chloroplast expression assays are done for each protein by Western Blot. Recombinant 
protein levels in transgenic plants are determined using quantitative ELISA assays. A standard 
curve is generated using known concentrations and serial dilutions of recombinant and native 
20 proteins. Different tissues are analyzed using young, mature and old leaves against these primary 
antibodies: goat anti-HSA (Nordic Immunology), anti-IGF-I and anti-Interferon alpha (Sigma). 
Bound IgG is measured using horseradish peroxidase-labelled anti-goat IgG. 

Inheritance of Introduced Foreign Genes 

25 While it is unlikely that introduced DNA would move from the chloroplast genome to 

nuclear genome, it is possible that the gene could get integrated in the nuclear genome during 
bombardment and remain undetected in Southern analysis. Therefore, in initial tobacco 
transformants, some are allowed to self-pollinate, whereas others are used in reciprocal crosses 
with control tobacco (transgenics as female accepters and pollen donors; testing for maternal 

30 inheritance). Harvested seeds (Tl) will be genninated on media containing spectinomycin. 
Achievement of homoplasmy and mode of inheritance can be classified by looking at 
germination results. Homoplasmy is indicated by totally green seedlings (86) while heteroplasmy 
is displayed by variegated leaves (lack of pigmentation, 83). Lack of variation in chlorophyll 
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iniong progeny also underscores the absence of position effect, an artifact of 
nuclear transformation. Maternal inheritance is be demonstrated by sole transmission of 
introduced genes via seed generated on transgenic plants, regardless of pollen source (green 
seedlings on selective media). When transgenic pollen is used for pollination of control plants, 
5 resultant progeny do not contain resistance to chemical in selective media (will appear bleached; 
83). Molecular analyses confirm transmission and expression of introduced genes, and T2 seed is 
generated from those confirmed plants by the analyses described above. 

Example 7 
1 0 Purification methods 

The standard method of purification employs classical biochemical techniques with the 
crystallized proteins inside the chloroplast In this case, the homogenates are passed through 
miracloth to remove cell debris. Centrifugation at 10,000 xg pelletizes all foreign proteins (3). 
Proteins are solubilized using pH, temperature gradient, etc. This is possible if the ORF1 and 2 
15 of the cry2Aa2 operon (see section c) can fold and crystallize the recombinant proteins as 
expected. Were there is no crystal formation, other purification methods must be used (classical 
biochemistry techniques and affinity columns with protease cleavage). 

HSA: Albumin is typically administered in tens of gram quantities. At a purity level of 99.999% 
20 (a level considered sufficient for other recombinant protein preparations), recombinant HSA 
(rHSA) impurities on the order of one mg will still be injected into patients. So impurities from 
the host organism must be reduced to a minimum. Furthermore, purified rHSA must be identical 
to human HSA. Despite these stringent requirements, purification costs must be kept low. To 
purify the HSA obtained by gene manipulation, it is not appropriate to apply the conventional 
25 processes for purifying HSA originating in plasma as such. This is because the impurities to be 
eliminated from rHSA completely differ from those contained in the HSA originating in plasma. 
Namely, rHSA is contaminated with, for example, coloring matters characteristic to recombinant 
HSA, proteins originating in the host cells, polysaccharides, etc. In particular, it is necessary to 
sufficiently eliminate components originating in the host cells, since they are foreign matters for 
30 living organisms including human and can cause the problem of antigenicity. 

In plants two different methods of HSA purification have been done at laboratory scale. 
Sijmons et al. (23) transformed potato and tobacco plants with Agrobacterium tumefaciens. For 
the extraction and purification of HSA, 1000 g of stem and leaf tissue was homogenized in 1000 
35 ml cold PBS, 0.6% PVP, 0.1 mM PMSF and 1 mM EDTA. The homogenate was clarified by 
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rifuged and the supernatant incubated for 4 h with 1 .5 ml polyclonal antiHSA 
coupled to Reactigel spheres (Pierce Chem) in the presence of 0.5% Tween 80. The complex 
HSA~anti HSA-Reactigel was collected and washed with 5 ml 0.5% Tween 80 in PBS. HSA was 
desorbed from the reactigel complex with 2.5 ml of 0.1 M glycine pH 2.5, 10% dioxane, 
5 immediately followed by a buffer exchange with Sephadex G25 to 50 mM Tris pH 8. The 
sample was then loaded on a HR5/5 MonoQ anion exchange column (Pharmacia) and eluted 
with a linear NaCl gradient (0-350 mM NaCl in 50 mM Tris pH 8 in. 20 min at lrnl/min). 
Fractions containing the concentrated HSA (at 290 mM NaCl) were ryophilized and applied to a 
HR 10/30 Sepharose 6 column (Pharmacia) in PBS at 0.3 ml/min. However, this method uses 
10 affinity columns (polyclonal anti-HSA) that are very expensive to scale-up. Also the protein is 
released from the column with 0.1M glycine pH 2.5 that will most probably, denature the 
protein. Therefore, this method can suitably modified to reduce these drawbacks.- 

The second method is for HSA extraction and purification from potato tubers (Dr. 

15 Mingo-Castel's laboratory). After grinding the tuber in phosphate buffer pH 7.4 (1 rng/2ml), the 
homogenate is filtered in miracloth and centrifuged at 14.000 rpm 15 minutes. After this step 
another filtration of the supernatant in 0.45 Dm filters is necessary. Then, chromatography of 
ionic exchange in FPLC using a DEAE Sepharose Fast Flow column (Amersham) is required. 
Fractions recovered axe passed through an affinity column (Blue Sepharose fast flow Amersham) 

20 resulting in a product of high purity. HSA purification based on either method is acceptable. 

IGF-1; All earlier attempts to produce IGF-I in E. colt or Saccharomyces cerevisiae have 
resulted in misfolded proteins. This has made it necessary to perform additional in vitro refolding 
or extensive separation techniques in order to recover the native and biological form of the 
25 molecule. In addition, IGF-I has been demonstrated to possess an mtrinsic thermodynamic 
folding problem with regard to quantitatively folding into a native disulfide-bonded 
conformation in vitro (131). Samuelsson et aL (131) and Joly et al. (132) co-expressed IGF-I 
with specific proteins of E. colt that significantly improved the relative yields of correctly folded 
protein and consequently facilitating purification. Samuelsson et al. (132) fused the protein to 
affinity tags based on either the IgG-binding domain (Z) from Staphylococcal protein A or the 
two serum albumin domains (ABP) from Streptococcal protein G (134). The fusion protein 
concept allows the IGF-I molecules to be purified by IgG or HSA affinity chromatography. We 
also use this Z tags for protein purification including the double Z domain from & aureus protein 
and a sequence recognized by TEV protease (see section d.2). The fusion protein is incubated 
with an IgG column where binding via the Z domain occurs. Z domain-IgG interaction is very 



